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SEQUENCE ENCODING MAMMALIAN SULFONYLUREA RECEPTOR 
AND METHOD OF DETECTING PERSISTENT HYPERINSULINEMIC 
HYPOGLYCEMIA OF INFANCY 
REFERENCE TO RELATED APPLICATIONS 

5 This application is a continuation-in-part of U.S. 

patent application Serial No. 08/226,972, filed April 13, 
1994, the disclosure of which is hereby incorporated by 
reference in its entirety. 
REFERENCE TO GOVERNMENT GRANTS 

10 This work was supported in part by research grants 

from the National Institutes of Health, grant number NIH 
R01DK41898 and R01DK44311. The United States Government may 
have certain rights in this invention. 
BACKGROUND OF THE INVENTION 

15 Sulfonylureas are oral hypoglycemics widely used in 

the treatment of Non-Insulin Dependent Diabetes Mellitus 
(NIDDM) . They enter the bloodstream, bind with high affinity 
to a pancreatic /3-cell plasma membrane protein termed the 
sulfonylurea receptor, and stimulate insulin release. The 

2 0 mechanism of stimulation is thought to be through inhibition 
of an ATP-sensitive K+ channel (K ATP ) , a key protein which 
sets the /S-cell resting membrane potential (Ashcroft, et al. 
Cell. Signal . 1990, 2, 197-214, all references cited herein 
are incorporated by reference in their entirety) . A 

2 5 reduction in potassium outflow causes depolarization of the 

plasma membrane, activation of L-type voltage-dependent 
calcium channels (VDCCs) , and increased cytosolic calcium. 
This triggers insulin release by as yet unknown mechanisms 
(Rajan, et al. Diabetes Care 1990, 13, 340-363). In NIDDM 

3 0 patients on sulfonylureas, the consequent reduction in blood 

glucose to more normal levels is thought to be critical in 
controlling the disease (Gerich, J.E. New Engl, J* Med. 1989, 
321, 1231-1245) . 
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The biochemistry of the sulfonylurea receptor (SUR) 
(Ashcroft et al Biochem. Biophys Acta 1992, 1175, 45-49 and 
Panten et al, Horm* Metab ♦ Res. 1992, 24 , 549-554) is 
consistent with the electrophysiology of the 0~cell K ATP 
5 channel. The endogenous regulators of channel activity 

include cytosolic nucleotides (ATP and Mg-ADP) and possibly 
phosphorylation* In the absence of cytosolic nucleotides, 
sulfonylureas weakly inhibit channel activity (Schwans teener 
et al* Br. J. Pharmacol 1992, 107, 87-94). When channels are 

10 activated by Mg-ADP, inhibition by ATP is strongly promoted 
by the presence of sulfonylureas. These results are 
interpreted as evidence that simultaneous occupancy of two 
nucleotide binding sites is required for effective channel 
inhibition by the sulfonylureas. The reported allosteric 

15 interactions correlate well with evidence that the brain 

receptor has two nucleotide binding sites (de Weille, et al. 
J . Biol. Chem 1992, 267, 4557-4563) physically located on 
the same polypeptide chain as the sulfonylurea binding site 
(Bernardi et al. Biochemistry 1992, 31, 6328-6332). One 

20 binding site appears to be specific for ATP , and is proposed 
to be the same site at which micromolar concentrations of ATP 
inhibit the K ATP channel. A second site has high affinity for 
Mg-ADP , with occupancy at this site promoting channel 
opening. Absolute concentrations of ATP and ADP in the cell 

25 are thought to regulate channel activity in a straightforward 
fashion (Hopkins et al. J. Membrane Biol. 19 92, 129, 287- 
295) . High ATP concentrations as a result of high serum 
glucose levels close the channel, stimulating insulin 
secretion. Reduced glucose levels increase intracellular ADP 

30 concentrations, and thereby increase the open channel 
probability, and decrease insulin secretion. 

Although sulfonylureas, particularly tolbutamide 
and more potent second generation drugs like glyburide and 
glipizide, are considered to be relatively specific ^ 

35 inhibitors of the K ATP channel, the exact relationship between 
the sulfonylurea receptor and the K ATP channel is not clear 
(Nichols et al. Am. J. Physiol. 1991, 261, H1675-H1686, 
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Takano et al. Progress in Neurobiology 1993, 41, 21-30, and 
Edwards et al. Annu . Rev. Pharmacol. Toxicol. 1993, 33, 597- 
637). m the insulin-secreting CRI-G1 cell line, the 
addition of glyburide, or tolbutamide to inside-out plasma 
5 membrane patches inhibits the K ATp channel (Khan et al. Proc 
*. Soc. Lond. B. 1993, 253, 225-231), intimating direct 
interactions between sulfonylureas and the channel protein 
in another insulin secreting cell line, CRI-D11 cells 
however, the loss of sulfonylurea binding sites with the 
10 retention of K ATP activity suggests these two activities may 
uncouple and reside on separate, transiently bound subunits 
(Khan et al. Proc . R. soc . Lond. B. 1993, 253, 225-231). 
Similarly, in other cell and tissue types, sulfonylurea 
binding and channel activity may be uncoupled (Ashford et al 
15 Br. J. Pharmac. 1990, 101, 531-540). A technique is not 
currently available to assess whether K ATP activity resides 
within the same polypeptide containing the putative 
nucleotide and sulfonylurea binding sites, or on separate 
loosely, or tightly bound subunits. 
20 A P re vious attempt to purify the receptor from 

hamster insulin-secreting tumor (HIT) cells was limited by 
the low abundance of the receptor and the presence of a more 
abundant co-purifying protein. Aguilar-Bryan, L. , et al . , 
JBC, 1990, 265, 8218. 

25 The sulfonylurea receptor is the target for drugs 

used in the treatment of type II diabetes (non-insulin 
diabetes mellitus) . This association has suggested it plays 
a role in the regulation of insulin secretion by glucose and 
makes the sulfonylurea receptor a potential diabetes 

30 candidate gene. 

Persistent hyperinsulinemic hypoglycemia of infancy 
(PHHI) is an autosomal recessive disorder of glucose 
homeostasis characterized by unregulated secretion of insulin 
and profound hypoglycemia. A. Aynsley-Green et al., Arch. 
35 Dis. Child. 1981, 56, 496. The pathophysiology of this 
disease remains obscure, but in vitro studies suggest a 
defect of glucose-regulated insulin secretion in pancreatic 
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islet /S~cells. Aynsley-Green et al., supra,, N. Kaiser et 
al., Diabetologia 1990, 33, 482. The incidence of PHHI has 
been estimated at 1/50,000 live births in a randomly mating 
population. G.J. Bruining, Curr. Opin. Pediatr. 1990, 2, 
5 758. However, in a Saudi Arabian population in which 51% of 
births occurred to parents who were first or second cousins, 
the incidence has been established as 1/2675 live births. 
P.M. Mathew et al., Clin. Pediatr. 1988, 27, 148. Recently, 
the PHHI gene was assigned to chromosome llpl415.1 by linkage 

10 analysis. B. Glaser et al., Mature Genet. 1994, 7, 185 and 
P.M. Thomas, G*J. Cote, D.M. Hallman, P.M. Mathew, Am. J. 
Hum. Genet. 1995, 56, 416-421. Candidate genes for this 
disorder include those involved in the 0-cell glucose sensing 
mechanism and insulin secretion. Localization of PHHI to 

15 chromosome lip excluded previously mapped genes involved in 
£-cell function. Considered as a candidate was the newly 
cloned high-affinity SUR gene, a member of the ATP-binding 
cassette superfamily, and a putative subunit of the modulator 
of insulin secretion, the 0-cell ATP-sensitive potassium 

20 channel (K ATf >) . S.J. Ashcroft and F. M. Ashcroft, Biochimica 
et Biophysica Acta, 1992, 1175, 45; U. Panten, M. 
Schwanstecher , and C. Schwanstecher , Horm, Metab . Res. 1992, 
24, 54 9. The methods of the present invention map the 
sulfonylurea receptor to the same chromosomal location as 

2 5 PHHI and provide evidence that mutations in the sulfonylurea 

receptor are the cause of PHHI. 

Accordingly, there remains a need to identify 
sulfonylurea receptor and sequences encoding sulfonylurea 
receptor which will provide: 

3 0 1. a correlation between sulfonylurea receptor 

and one or more forms of diabetes, 

2. a sequence to purify human sulfonylurea 
receptors, 

3. an isolated sulfonylurea receptor, prepared by 
35 recombinant methods, 
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4. polyclonal and monoclonal antibodies and 
methods of preparing the same against sulfonylurea receptor, 



5, 



20 



information as to whether this receptor-ion 
channel family involves multi-subunits within each channel 
5 for channel activity, 

6. gene therapy such that sequences which encode 
mutant sulfonylurea receptors are replaced by wild type 
sulfonylurea receptor sequences, 

7. a method of screening to identify drugs which 
10 react with and bind to the sulfonylurea receptor, 

8. non-human transgenic animals to study diabetes 
and PHHI, and the physiologic effects of varying levels of 
sulfonylurea receptor, by using an inducible promoter to 
regulate the expression of the sulfonylurea receptor, for 

15 example , and 

9- probes, including PCR probes, for diagnosing 
conditions associated with the expression of a specific 
sulfonylurea receptor allele. 

The present invention reveals that the sequence 
encoding the mammalian sulfonylurea receptor maps to the 
sequence encoding persistent hyperinsulinemic hypoglycemia of 
infancy. 

SUMMARY OF THE INVENTION 

The present invention provides sequences encoding a 
25 sulfonylurea receptor. Nucleic acid sequences, SEQ ID NO: i, 
4, 6, 26, 27, 32, 33, 35, and 36 are cDNA sequences to which' 
the present invention is directed. The nucleic acid 
sequences, SEQ ID NOS: 2 and 3 , may be added to the 5' end 
SEQ ID NO: l to furnish a sequence encoding a sulfonylurea 
30 receptor. SEQ ID NOS: 32, 33, 35 and 36 are rodent sequences 
(SEQ ID NOS: 32 and 33 - rat, SEQ ID NOS: 34 and 35 - 
hamster) encoding sulfonylurea receptor which functionally 
bind sulfonylurea. SEQ ID NOS: 26 and 27 are human sequences 
which encode sulfonylurea receptor. SEQ ID NOS: 27, 33, and 
3 5 3 6 set forth DNA sequences translated into amino acid 
sequences, which set forth below the DNA sequence. 
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A further aspect of the present invention provides 
sulfonylurea receptor polypeptides and/or proteins. SEQ ID 
NOS: 4, 5, 6, 7, 26, and 28 are novel polypeptides of the 
invention produced from nucleotide sequences encoding rat 
5 (SEQ ID NOS: 4, 5, and 34), hamster (SEQ ID NOS: 6, 7, and 
37), and human (SEQ ID NOS: 26 and 28) sulfonylurea receptor, 
respectively. Also within the scope of the present invention 
is a purified sulfonylurea receptor. 

The present invention also provides nucleic acid 

10 sequences encoding a sulfonylurea receptor, expression 
vectors comprising a nucleic acid sequence encoding a 
sulfonylurea receptor, transformed host cells capable of 
expressing a nucleic acid sequence encoding a sulfonylurea 
receptor, cell cultures capable of expressing a sulfonylurea 

15 receptor, and protein preparations comprising a sulfonylurea 
receptor . 

A method of detecting persistent hyper insulinemic 
hypoglycemia of infancy comprising obtaining a sample 
comprising patient nucleic acids from a patient tissue 

20 sample; amplifying sulfonylurea receptor specific nucleic 
acids from said patient nucleic acids to produce a test 
fragment; obtaining a sample comprising control nucleic acids 
from a control tissue sample; amplifying control nucleic 
acids encoding wild type sulfonylurea receptor to produce a 

25 control fragment; comparing the test fragment with the 
control fragment to detect the presence of a sequence 
difference in the test fragment, wherein a difference in said 
test fragment indicates persistent hyperinsulinemic 
hypoglycemia of infancy is also an embodiment of the present 

30 invention. 

Other methods of the present invention include a 
method of detecting persistent hyperinsulinemic hypoglycemia 
of infancy comprising obtaining a sample comprising patient 
mRNA from a patient tissue sample; reverse transcribing said 

3 5 mRNA into cDNA to produce patient cDNA; amplifying 

sulfonylurea receptor specific cDNA from said patient cDNA to 
produce amplified patient cDNA ; obtaining a sample comprising 
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control nucleic acids from a control tissue sample; 
amplifying control DNA encoding wild type sulfonylurea 
receptor to produce control cDNA ; digesting said test 
fragment and said control fragment with a selected 
5 endonuclease; and comparing the test fragment to the control 
fragment, wherein said test fragment indicates persistent 
hyperinsulinemic hypoglycemia of infancy. 

Another embodiment of the present invention is a 
diagnostic kit for detecting persistent hyperinsulinemic 

10 hypoglycemia of infancy comprising in one or more containers 
a pair of primers, wherein one primer within said pair is 
complementary to a region of the sulfonylurea receptor, 
wherein one of said pair of primers is selected from the 
group consisting of SEQ ID NOS: 16-24, a probe specific to 

15 the amplified product, and a means for visualizing amplified 
DNA, such as and not limited to fluorescent stain, 32 P, and 
biotin, and optionally including one or more size markers, 
positive and negative controls, and restriction 
endonucleases . 

2 0 Still another embodiment of the present invention 

includes the primer sequences identified in SEQ ID NOS: 16- 
24 . 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 displays characteristics of the purified 
25 HIT cell receptor. The radiolabeled receptor (lanes 1 and 3) 
cleaved with endoglycosidase F/N-glycosidase F (endo F) , 
increases the mobility of the protein by approximately 3 kDa 
(lane 2) . Subsequent partial V8 protease digestion (lanes 4 
and 6) yielded radiolabeled fragments that also shift 
30 mobility with endo F treatment (lane 5). Each of these 
species has the same N-terminal sequence (left side of 
figure) , except that receptor deglycosylation results in an 
Asp at residue 9 * 

Figure 2A shows that antibodies against residues 
35 10-20 specifically recognize the 140 kDa polypeptide. 

purified 140 kDa polypeptide was electrophoresed on a single 
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lane of a 6% SDS gel and transferred to Immobilon P. The 
Immobilon P was placed in a miniblotter and the lanes 
incubated as follows: Lane 1 - Preimmune serum. Lane 2 - 
Immune serum. Lane 3 -Immune serum + immunogen. Lane 4 - 
5 immune serum + irrelevant MAP peptide. The filter was 

further incubated with a second antibody (goat anti-rabbit 
conjugated to alkaline phosphatase) and developed with the 
appropriate substrates. Figure 2B displays antibodies which 
recognize a polypeptide with the appropriate mobility shift 

10 following Endo F treatment. Purified receptor (lane 1) was 
incubated for 3 0 min at 3 7 °c in the presence (lane 2) f or 
absence (lane 3) of endoglycosidase F/N-glycosidase F, 
incubated with first (anti-KAP 10-20) and second antibody, 
and developed with substrate. The bottom panel shows the 

15 autoradiogram of the immunoblot in the top panel. Figure 2C 
shows antibodies which immunoprecipitate the photolabeled 140 
kDa receptor, HIT cell membranes were incubated with 125 I- 
labeled iodoglyburide, photolabeled, solubilized with 1% 
digitonin, centrifuged at 100,000 x g and the supernatant 

2 0 (lane 1) incubated with preimmune serum (lane 2) , immune 
serum (lane 3), immune serum + anti-MAP 10-20 (lane 4) and 
immune serum + irrelevant MAP peptide (lane 5). Samples were 
co-incubated with protein A-Sepharose, the beads washed with 
buffer, heated in the presence of pH 9 sample buffer, 

2 5 electrophoresed on a 6% polyacrylamide SDS gel, and an 

autoradiogram prepared. Results using antibodies against 
receptor residues 1-8 were the same as those using antibodies 
against residues 10-20, 

Multiple antigenic peptides (MAPS) were synthesized 
30 (Posnett et al. J. Biol. Chem. 1988 263:1719-1725) and 
polyclonal antibodies generated in rabbits produced by 
standard methods (Antibodies: A Laboratory Manual Cold Spring 
Harbor Laboratory 1988) . Interdermal injections of l mg of 
antigen were spaced 2-3 weeks apart, and contained complete, 

3 5 or incomplete Freund's adjuvant 

Figure 3 shows the nucleotide sequence of cDNAs 
encoding the high affinity sulfonylurea receptors from rat 
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and hamster together with the deduced amino acid sequences. 
DNA sequencing was performed by the dideoxy chain termination 
method with 35 S-labeled nucleotides. Numbers on the left of 
each column indicate the nucleotide positions; numbers on the 
5 right of each column indicate the amino acid positions. The 
amino acid sequence of four peptides determined by chemical 
methods are marked in bold italics* The putative ATP-binding 
domains are shown underlined and the Walker (Walker et al. 
EHBO Jour. 1982 1:945-951) "A" and "B" consensus sequences 

10 are boxed in dotted lines. The Walker A sequence, GXXGXGK 

(SEQ ID NO: 14) and B sequence, xVx 1 D (SEQ ID NO: 15) (where 
X 1 is a hydrophobic amino acid) are part of a common 
nucleotide binding fold found in a number of nucleotide 
binding proteins. The G residues in the A sequence interact 

15 with a phosphate in the nucleotide. Possible sites of 

phosphorylation by protein kinase A (closed diamonds) and C 
(open diamonds) are indicated. Potential transmembrane 
spanning helices assigned using the algorithm of Eisenberg et 
al* (Eisenberg/ et al. J. Hoi. Biol. 1984 179:125) are boxed. 

20 The nucleotide sequence and first row of amino acid sequence 
are for the rat; the amino acid sequence differences are 
given in the second row for the hamster. A consensus poly A 
addition site is indicate in bold at the 3' end of the cDNA . 
Abbreviations for the amino acid residues are: A, ala; C, 

25 cys; D, asp; E, glu; F, phe; G, gly; H, his; I, ile; K, lys; 
L, leu; M, met; N, asn; P, pro; Q, gin; R, arg; S, ser, T, 
thr; V, val; W, trp; and Y , tyr. 

Figure 4 displays the deduced amino acid sequence 
of the sulfonylurea receptor aligned with the amino acid 

30 sequence of a MRP, multidrug resistance-associated protein, 
(dvhuar) . The alignment was generated with PILEUP from the 
Genetics Computer Group package (Madison, Wl) (Genetics 
Computer Group Program Manual for the GCG Package Version 7 
1991) using a modification of the algorithm of Feng and 

35 Doolittle (Feng, et al. J . Mol . Evol , 1987 25:351). 

Figure 5 exhibits the sequence comparison of 
nucleotide binding folds. The alignment was generated using 
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the PILEUP program as described in Figure 4. The Walker A 
and B consensus sites are given in boldface. Upper case 
letters indicate a match to the consensus sequence generated 
using the pretty program (Genetics Computer Group, Madison, 
5 WI) . 

Figure 6 is a northern blot of total RNA from a- 
and j3-cell lines hybridized with a 2.2 kb EcoRI-Xhol fragment 
of the sulfonylurea receptor. Approximately 10 tig of RNA 
from (A) aTC-6 cells, (B) HIT cells, (C) RIN cells and (D) 
10 mouse liver was analyzed using standard procedures (Ausubel 
et al. Current Protocols in Mol . Biol. 1994). The estimated 
size of the major component is approximately 5000 
nucleotides. 

Figure 7 displays a hydrophobicity profile of the 

15 Rat Sulfonylurea receptor. Hydrophobicity values were 

determined according to Kyte and Doolittle (Kyte et al. J". 
Mol. Biol. 1982 157:105-132) for 11-residue peptides and are 
plotted versus the amino acid number. The bars marked A and 
B are over the Walker A and B consensus sequences Walker et 

20 al. EMBO Jour. 1982 1:945-951). 

Figure 8 shows a schematic model of the high 
affinity sulfonylurea receptor. The Walker A and B sites are 
marked within the two nucleotide binding folds. Based on the 
hydrophobicity and hydrophobic moment data there are nine 

25 transmembrane spanning domains before the first nucleotide 
binding fold and four transmembrane spanning domains between 
the two folds* The branched structure at the N-terminus of 
the mature receptor symbolizes glycosylation. 

Figure 9A reveals the results of in vitro 

30 translation of mRNA transcribed from the rat sulfonylurea 
cDNA. The cDNA was subcloned into pGEM4 (Promega, Inc., 
Madison, WI) and transcribed using the SP6 promoter and SP6 
RNA polymerase following the manufacturer's directions. RNA 
was translated in rabbit reticulocyte lysate (Promega, Inc.) 

35 

35 following the manufacturer's recommendations for S- 

methionine. Lane 1 is the HIT cell photolabeled receptor as 
a marker, lane 2 is the in vitro translation product 



BNSDOCID: <WO 952841 1A1J_> 



WO 95/28411 



PCT/US95/04463 



15 



- 11 - 

resulting from addition of receptor mRNA and lane 3 is the 
result of no added RNA. The arrow marks the 140 kDa protein. 

Figure 9B displays a gel of the results of 
immunoprecipitation of the RIN cell sulfonylurea receptors 
5 with polyclonal antibodies directed against a nucleotide 

binding fold domain (NBF). Lane l; 140 and 150 kDa receptors 
from soluble RIN cell membrane proteins, lane 2; 
immunoprecipitation with preimmune serum, lane 3; immune 
serum from rabbit immunized with NBF2 , lane 4; immune serum + 
10 NBF2 fusion protein. Sulfonylurea receptor cDNA regions 
encoding the NBF2 domain was subcloned in frame into pMALc 
and expression of the proteins fused with maltose binding 
protein induced in E. coli. Fusion proteins were purified by 
electrophoresis and electroelution, and 200 M g amounts, with 
complete, or incomplete Freund's adjuvant, injected 
interdermally into rabbits using a standard 2-3 week regimen 
of bleeding and boosting. 

Figure 10 displays the genomic organization and 
CDNA sequence of the human sulfonylurea receptor (SUR) 
homologue in the second nucleotide binding fragment region 
(NBF— 2) . The sequence encoding NBF- 2 is located within SEQ 
ID NO: 26, nucleic acid positions 524 to 1048. Solid 
rectangles represent exons which are labeled a-0 for 
identification. The numbers between rectangles represent 
25 intronic sizes. Primers used in mutational analysis are 
diagrammed and listed in SEQ ID NOS: 16-24. 

Figure 11A-D display the exon mutation in the SUR 
NBF-2. Figures llA is a schematic representation of NBF— 2 
exons x, 6 illustrating the normal (upper) and mutant 
30 (lower) RNA splicing patterns. Figure 11B displays the 

sequence of a pancreatic cDNA product, from an affected child 
of Family 6, demonstrating the exon skipping event. Skipping 
of exon X results in a 109 bp deletion in the mRNA 
transcript, a frame shift and inclusion of a premature stop 
35 codon. Single upper case letters represent amino acids. 
Figure lie shows the sequence of genomic DNA from the 
affected patient in Figure HB which reveals a G to A point 
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mutation at the 3' end of the exon, which exon is excised in 
mRNA, as compared to a normal sample of genomic DNA. Exonic 
sequence is in upper case and intronic sequence in lower case 
letters. Figure 11D shows Mspl restriction enzyme analysis 
5 of PCR-amplif ied genomic DNA from members of Family 6, 
indicating affected individuals. The G to A mutation 
destroyed a restriction site for Mspl {C/CGG) . Normal PCR 
product is digested into 3 04 bp, 85 bp, and 38 bp fragments, 
while that containing the mutation is digested into 3 04 bp 

10 and 123 bp fragments. MW is 100 bp ladder (GIBCO-BRL, 
Gaithersburg, Maryland) , UC is an uncut sample, C is a 
control PCR reaction lacking template. 

Figure 12A-D reveal a mutation in the intron 
preceding NBF-2 exon a, which activates cryptic 3' splice 

15 site usage. Figure 12A displays the sequence of genomic DNA 
from an affected member of Family 4 which revealed a G to A 
mutation in the splice site preceding the first exon of the 
NBF-2. Figure 12B shows Neil restriction enzyme analysis of 
genomic DNA from members of Family 4 r indicating affected 

20 individuals. The G to A mutation destroys a restriction site 
for Neil (CC/ (G/C) GG) . Normal PCR product is digested into 
71 bp and 75 bp fragments, while that containing the mutant 
sequence is not cut. By previous haplotype analysis, the 
unaffected sibling in this family had two wild type alleles, 

25 P.M. Thomas, G.J. Cote, D.M. Hallman, P.M. Mathew, Am. J. 
Hum. Genet., supra. Figure 12C illustrates the constructs 
used to examine RNA processing of exons within NBF-2* Solid 
rectangles and thin lines represent human SUR gene exonic and 
intronic sequences, respectively. The unmarked solid 

3 0 rectangle represents a portion of the exon which is 5' to 

exon a of the NBF-2 region. That labeled RSV represents the 
enhancer and promoter isolated from the rous sarcoma virus 
long terminal repeat. The thick line represents an intronic 
sequence derived from vector and the human metallothionine 

35 IIA gene, which also contains polyadenylation signals. 
Normal and mutant RNA splicing patterns, including the 
location of the three cryptic splice sites, are diagrammed in 
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the lower portion* The open triangle marks the position of 
the mutated base within the splice site. Figure 12D shows 
PCR amplification across splice site of normal (N) and mutant 
(M) cDNA transcripts, isolated 48 hours after transfection 
5 with the splicing constructs. Subcloning and sequencing of 
these products revealed their identity as diagrammed in 
Figure 12C. The control (C) represents cDNA amplified from 
untransf ected cells. 

DETAILED DESCRIPTION OF THE INVENTION 

10 The present invention is directed to the nucleic 

acid and protein sequences encoding a sulfonylurea receptor. 
The present invention provides a nucleotide sequence of a 
sulfonylurea receptor, Figure 3, and SEQ ID NOS: 1, 4, 6, 33, 
and 36* Novel polypeptide sequences, SEQ ID NOS: 4, 5, 6, 

15 7, 34, and 37 coding for a sulfonylurea receptor are also 
included in the present invention- SEQ ID NOS: 26, 27, and 
2 8 provide the nucleic acid and amino acid sequences of the 
last 11 exons of the 3' end of human sulfonylurea receptor, 
hereinafter referred to, together with the rodent sequences 

20 for sulfonylurea receptor, as sequence for the sulfonylurea 
receptor . 

SEQ ID NO: 1 provides the cDNA sequence of a rodent 
sulfonylurea receptor. SEQ ID NOS: 2 6 and 27 provide the 
human genomic DNA sequence of sulfonylurea receptor. Nucleic 

25 acids within in the scope of the present invention include 
cDNA, RNA, genomic DNA, sequences within these larger 
sequences, antisense oligonucleotides. Sequences encoding 
the sulfonylurea receptor also include amino acid, 
polypeptide, and protein sequences. Variations in the 

3 0 nucleic acid and polypeptide sequences of the present 

invention are within the scope of the present invention and 
include N terminal and C terminal extensions, transcription 
and translation modifications, and modifications in the cDNA 
sequence to facilitate and improve transcription and 

35 translation efficiency. In addition, mismatches within the 
sequences identified herein, which achieve the methods of the 
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invention, such that the mismatched sequences are 
substantially complementary to the sulfonylurea receptor 
sequences identified, are also considered within the scope of 
the present invention. Mismatches which permit substantial 
5 complementarity to the sulfonylurea receptor sequences, such 
as similarity in residues in hydrophobicity , will be known to 
those of skill in the art once armed with the present 
disclosure. In addition, the sequences of the present 
invention may be natural or synthetic. 

10 SEQ ID NOS: 2 and 3 are rat and hamster sequences, 

respectively, which may be added to SEQ ID NO: 1 at the 5' 
end preceding the coding region of the sulfonylurea gene. 

The cDNA and amino acid sequences identified in 
Figure 3 indicate several amino acid residue differences 

15 between rat and hamster sulfonylurea receptor. These 

differences are indicated in the corresponding nucleic acid 
sequence of SEQ ID NO: 1 by an "N" in the positions which 
result in different amino acids. Accordingly, "N" appears in 
SEQ ID NO: 1, at nucleic acid positions 586, 752, 802, 842, 

20 1204, 1318, 1518, 1711, 2000, 2009, 2245, 2275, 2278, 2280, 
2281, 2288, 2305, 2383, 2384, 2518, 2540, 2548, 2848, 2850, 
2913, 3173, 3208, 3230, 3463, 3601, 3751, 4442. "N" may be 
any nucleic acid residue which provides an amino acid which 
results in a sulfonylurea receptor. For example, SEQ ID NOS: 

25 4 and 6 provide cDNAs of rat and hamster, respectively, where 
N of SEQ ID NO: 1 is identified as a particular nucleic acid 
such that a corresponding amino acid may be provided in the 
protein. In addition, an Asparagine is inserted in the 
hamster sequence at amino acid position 74 0 and a threonine 

3 0. is deleted in the hamster sequence at amino acid position 
831. The N positions indicated for SEQ ID NO: l result in 
differences in the rat and hamster sequences provided in SEQ 
ID NOS: 4 and 5 (rat) , 6 and 7 (hamster) at amino acid 
positions 196 - Val, lie; 251 - Gly, Ala; 268 - Leu, Val ; 281 

35 - Gin, Pro; 300 - Val, He; 402 - Leu, Met; 440 - Ala, Thr; 
491 - Tyr, His; 506 - He, Met; 516 - Asn, Ser; 524 - Lys, 
Val; 571 - Phe, Leu; 667 - Thr f Ala; 670 - Thr, Met; 749 - 
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Arg, 750 - Gly; 759 - Gly, 760 - Ser; 760 - Arg, 761 - Trp; 
761 - Phe, 762 - Leu; 763 - Cys, 764 - Tyr; 769 - Cys, 770 - 
Arg; 795 - Lys, 796 - Pro; (two series of numbers appear due 
to the asparagine deletion in the hamster sequence, the 
5 hamster codon is represented by the second number) 84 0 - Arg, 
Gly; 847 - Thr, Arg; 850 - lie, Phe; 950 - Pro, Ser; 971 - 
Asp, Glu; 1058 - Ala, Asp; 1070 - Val, Leu; 1077 - Ala, Val; 
1155 - Ala, Thr; 1201 - Leu, Val; 1251 - Arg, Cys; and 1481 - 
Lys, Thr. In addition, the hamster sequence has an Asn 

10 insert at position 741 and a Thr deletion at position 831, 
A purified sulfonylurea receptor is also provided 
by the present invention. The purified sulfonylurea receptor 
may have an amino acid sequence as provided by SEQ ID NOS: 4, 
5, 6, 7, 34, and 37. A purified sulfonylurea receptor having 

15 the amino acid sequence of SEQ ID NO: 28 is also within the 
scope of the present invention. 

The present invention is directed to sulfonylurea 
receptor sequences obtained from mammals from the Order 
Rodentia, including and not limited to hamsters, rats, and 

2 0 mice; Order Logomorpha, such as rabbits; more particularly 
the Order Carnivora, including Felines (cats) and Canines 
(dogs) ; even more particularly the Order Artiodactyla, 
Bovines (cows) and Suines (pigs) ; and the Order 
Perissodactyla, including Equines (horses) ; and most 

25 particularly the Order Primates, Ceboids and Simoids 

(monkeys) and Anthropoids (humans and apes) . The mammals of 
most preferred embodiments are humans. 

There are several transfection techniques by which 
a sulfonylurea receptor may be obtained. An appropriate RNA 

.30 may be hybridized to a cDNA to obtain a sulfonylurea receptor 
nucleic acid sequence. A nucleic acid sequence encoding 
sulfonylurea receptor may be inserted into cells and the 
corresponding protein immunoprecipitated with an antibody. 
Labeled drugs known to bind sulfonylurea receptor protein may 

35 be added to cell culture to label the receptor. The drug 
labeling procedure may involve modifying cells such that the 
cell culture provides conditions similar to (3 cells, cells 
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where sulfonylurea receptors naturally appear; and the 
sulfonylurea receptor may be part of a larger multisubunit 
ATP receptor channel, which may not be provided by the cells 
in culture. 

5 Generally, the sequences of the invention may be 

produced in host cells transformed with an expression vector 
comprising a nucleic acid sequence encoding the sulfonylurea 
receptor. The transformed cells are cultured under 
conditions whereby the nucleic acid sequence coding for the 

10 sulfonylurea receptor is expressed. After a suitable amount 
of time for the protein to accumulate, the protein is 
purified from the transformed cells, 

A gene coding for sulfonylurea receptor may be 
obtained from a cDNA library. Suitable libraries can be 

15 obtained from commercial sources such as Clontech, Palo Alto, 
CA. Libraries may also be prepared using the following non- 
limiting examples hamster insulin-secreting tumor (KIT) , 
mouse aT06, and rat insulinoma (RIN) cells* Positive clones 
are then subjected to DNA sequencing to determine the 

20 presence of a DNA sequence coding for sulfonylurea receptor. 
DNA sequencing is accomplished using the chain termination 
method of Sanger et al . , Proc . Nat'l. Acad. Sci, U.S.A., 
1977, 74, 5463. The DNA sequence encoding sulfonylurea 
receptor is then inserted into an expression vector for later 

25 expression in a host cell. 

Expression vectors and host cells are selected to 
form an expression system capable of synthesizing 
sulfonylurea receptor. Vectors including and not limited to 
baculovirus vectors may be used in the present invention, 

30 Host cells suitable for use in the invention include 

prokaryotic and eukaryotic cells that can be transformed to 
stably contain and express sulfonylurea receptor. For 
example, nucleic acid coding for the recombinant protein may 
be expressed in prokaryotic or eukaryotic host cells, 

35 including the most commonly used bacterial host cell for the 
production of recombinant proteins, E. coli. Other microbial 
strains may also be used, however, such as Bacillus subtllls , 
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and other enterobacteriaceae such as Salmonella typhlmurium 
or Serratla marcescens , various species of Pseudomonas , or 
other bacterial strains. 

Commonly used eukaryotic systems include yeast, 
5 such as Saccharomyces cerevislae; insect cells, such as 
Spodoptera frugxperda; chicken cells, such as E3C/0 and SL- 
29; mammalian cells, such as HeLa, Chinese hamster ovary 
cells (CHO) , COS-7 or MDCK cells and the like. The foregoing 
list is illustrative only and is not intended in any way to 

10 limit the types of host cells suitable for expression of the 
nucleic acid sequences of the invention. 

As used herein, expression vectors refer to any 
type of vector that can be manipulated to contain a nucleic 
acid sequence coding for sulfonylurea receptor, such as 

15 plasmid expression vectors and viral vectors. The selection 
of the expression vector is based on compatibility with the 
desired host cell such that expression of the nucleic acid 
encoding sulfonylurea receptor results. Plasmid expression 
vectors comprise a nucleic acid sequence of the invention 

2 0 operably linked with at least one expression control element 
such as a promoter. In general, plasmid vectors contain 
replicon and control sequences derived from species 
compatible with the host cell. To facilitate selection of 
plasmids containing nucleic acid sequences of the invention, 

2 5 plasmid vectors may also contain a selectable marker such as 

a gene coding for antibiotic resistance. Suitable examples 
include the genes coding for ampicillin, tetracycline, 
chloramphenicol or kanamycin resistance. 

Suitable expression vectors, promoters, enhancers, 

3 0 and other expression control elements are known in the art 

and may be found in Sambrook et al., Molecular Cloning: A 
Laboratory Manual, second edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY (1989), incorporated 
herein by reference in its entirety. 
35 Transformed host cells containing a DNA sequence 

encoding sulfonylurea receptor may then be grown in an 
appropriate medium for the host. The cells are then grown 
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until product accumulation reaches desired levels at which 
time the cells are then harvested and the protein product 
purified in accordance with conventional techniques. 
Suitable purification methods include, but are not limited 
5 to, SDS PAGE electrophoresis, phenylboronate-agarose, 
reactive green 19-agarose, concanavalin A sepharose, ion 
exchange chromatography, affinity chromatography, 
electrophoresis, dialysis and other methods of purification 
known in the art. 

10 Protein preparations, of purified or unpurified 

sulfonylurea receptor produced by host cells, are accordingly 
produced which comprise sulfonylurea receptor and other 
material such as host cell components and/or cell medium, 
depending on the degree of purification of the protein. 

15 Antibodies, including and not limited to 

monoclonal, polyclonal, and chimeric, prepared and used 
against a sulfonylurea receptor are also within the scope of 
the present invention, and may be prepared by methods known 
to those of skill in the art such as and not limited to the 

20 methods of Kohler and Milstein, Nature, 256: 495-497 (1975), 
incorporated herein by reference in its entirety. 

The invention also includes a transgenic non-human 
animal, including and not limited to mammals, such as and not 
limited to a mouse, rat, or hamster, whose germ cells and 

25 somatic cells contain a sequence encoding a sulfonylurea 
receptor introduced into the animal or an ancestor of the 
animal. The sequence may be wild-type or mutant and may be 
introduced into the animal at the embryonic or adult stage. 
The sequence is incorporated into the genome of an animal 

3 0 such that it is chromosomally incorporated into an activated 
state. Embryo cells may be transfected with the gene as it 
occurs naturally, and transgenic animals are selected in 
which the gene has integrated into the chromosome at a locus 
which results in activation. Other activation methods 

35 include modifying the gene or its control sequences prior to 
introduction into the embryo. The embryo may be transfected 
using a vector containing the gene. 
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In addition, a transgenic non-human animal may be 
engineered wherein the sulfonylurea receptor is suppressed. 
For purposes of the present invention, suppression of the 
sulfonylurea receptor includes, and is not limited to 
5 strategies which cause the sulfonylurea receptor not to be 
expressed. Such strategies may include and are not limited 
to inhibition of protein synthesis, pre-mRNA processing, or 
DNA replication. Each of the above strategies may be 
accomplished by antisense inhibition of sulfonylurea receptor 

10 gene expression. Many techniques for transfer ing antisense 
sequences into cells are known to those of skill, including 
and not limited to microinjection, viral-mediated transfer, 
somatic cell transformation, transgene integration, and the 
like, as set forth in Pinkert, Carl, Transgenic Animal 

15 Technology, 1994, Academic Press, Inc., San Diego, CA, 
incorporated herein by reference in its entirety. 

Further, a transgenic non-human animal may be 
prepared such that the sulfonylurea receptor gene is knocked 
out. For purposes of the present invention, a knock out 

2 0 includes and is not limited to disruption or rendering null 

the sulfonylurea receptor gene. A knock out may be 
accomplished, for example, with antisense sequences for the 
sulfonylurea receptor mutating the sequence for the 
sulfonylurea receptor. The sulfonylurea receptor gene may be 
25 knocked out by injection of an antisense sequence for all or 
part of the sulfonylurea receptor sequence such as an 
antisense sequence for all or part of SEQ ID NO: 27. Once 
the sulfonylurea receptor has been rendered null, correlation 
of the sulfonylurea receptor to persistent hyperinsulinemic 

3 0 hypoglycemia of infancy may be tested. Sequences encoding 

mutations affecting the sulfonylurea receptor may be inserted 
to test alterations in glucose homeostasis. 

Also in transgenic non-human animals, the 
sulfonylurea receptor may be replaced by preparing a 
35 construct having an insulin promoter ligated to the 

sulfonylurea receptor gene. This experiment permits testing 
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of mutant sulfonylurea receptors directly in the pancreas of 
the transgenic animal. 

Transgenic non-human animals may also be useful for 
testing nucleic acid changes to identify nucleotides which 
- 5 are responsible for ADP and ATP modulation of the 

sulfonylurea receptor resulting in an increase or decrease in 
glucose sensitivity of insulin release. 

The present invention is also directed to gene 
therapy wherein a mutant sulfonylurea receptor is replaced by 

10 a wild type sulonylurea receptor. A resulting transgenic 
non-human animal thus comprises a recombinant sulfonylurea 
receptor. In addition, gene therapy techniques may be used 
for individuals with persistent hyperinsulinemic hypoglycemia 
of infancy. For purposes of the present invention, gene 

15 therapy refers to the transfer and stable insertion of new 
genetic information into cells for the therapeutic treatment 
of diseases or disorders. The foreign gene is transferred 
into a cell that proliferates to spread the new gene 
throughout the cell population. Known methods of gene 

2 0 transfer include microinjection, electroporation, liposomes, 

chromosome transfer, transfection techniques, 
calcium-precipitation transfection techniques, and the like. 

Numerous techniques are known in the art for the 
introduction of foreign genes into cells and may be used to 
25 construct the recombinant cells for purposes of gene therapy, 
in accordance with this embodiment of the invention. The 
technique used should provide for the stable transfer of the 
heterologous gene sequence to the stem cell, so that the 
heterologous gene sequence is heritable and expressible by 

3 0 stem cell progeny, and so that the necessary development and 

physiological functions of the recipient cells are not 
disrupted. Techniques which may be used include but are not 
limited to chromosome transfer (e.g., cell fusion, 
chromosome-mediated gene transfer, micro cell-mediated gene 
35 transfer), physical methods (e.g., transfection, spheroplast 
fusion, microinjection, electroporation, liposome carrier) , 
viral vector transfer (e.g., recombinant DNA viruses, 
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10 



recombinant RNA viruses) and the like (described in Cline, M. 
J., 1985, Pharmac. Ther. 29:69-92, incorporated herein by' 
reference in its entirety) . 

The term "purified", when used to describe the 
state of nucleic acid sequences of the invention, refers to 
nucleic acid sequences substantially free of nucleic acid not 
coding for sulfonylurea receptor or other materials normally 
associated with nucleic acid in non-recombinant cells, i.e., 
in its "native state." 

The term "purified" or "in purified form" when used 
to describe the state of a sulfonylurea receptor, protein, 
polypeptide, or amino acid sequence, refers to sulfonylurea 
receptor sequences free, to at least some degree, of cellular 
material or other material normally associated with it in its 
15 native state. Preferably the sequence has a purity 

(homogeneity) of at least about 25% to about 100%. More 
preferably the purity is at least about 50%. 

To begin to elucidate the relationship between the 
sulfonylurea receptor and K ATP , the iodinated derivative of 
2 0 glyburide was used to identify, and subsequently to purify 
and obtain N-terminal amino acid sequence from the 140 kDa 
high affinity, hamster insulin-secreting tumor (HIT) cell 
sulfonylurea receptor. The peptide sequence data was used to 
clone full length cDNAs encoding the rat and hamster 0-cell 
25 proteins of the present invention. 

Another embodiment of the present invention is a 
method of detecting persistent hyperinsulinemic hypoglycemia 
of infancy comprising obtaining a sample comprising patient 
nucleic acids from a patient tissue sample; amplifying 
sulfonylurea receptor specific nucleic acids from said 
patient nucleic acids to produce a test fragment; obtaining a 
sample comprising control nucleic acids from a control tissue 
sample; amplifying control nucleic acids encoding wild type 
sulfonylurea receptor to produce a control fragment; 
comparing the test fragment with the control fragment to 
detect the presence of a sequence difference in the test 
fragment, wherein a difference in said test fragment 



.30 
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indicates persistent, hyperinsulinemic hypoglycemia of infancy 
is also an embodiment of the present invention. 

Persistent hyperinsulinemic hypoglycemia of infancy 
(PHHI) is an autosomal recessive disorder which results in 
5 unregulated insulin secretion. The present invention 
revealed several different mutations in the sulfonylurea 
receptor in individuals with PHHI. These mutations include 
nucleic acid transition and restriction fragment length 
polymorphism, both defined herein as sequence differences. 

10 The nucleic acid sequence transition may be a G to A 

transition at nucleic acid position 750 in SEQ ID NO: 26 
which results in PHHI. This transition was found to occur in 
nine affected children in nine different families of the 
families studied. The pancreatic cDNA from a child with this 

15 transition involved skipping an exon. Exon X of Figure 10 
was skipped resulting in an mRNA transcript having a 109 bp 
deletion, a frame shift, and the inclusion of a premature 
stop codon. This deletion may be seen by performing rtPCR on 
the child's mRNA. Amplification of SEQ ID NO: 2 6 resulted in 

20 a 427 base pair product for the normal as well as for the 

mutant genomic DNA. Digesting the normal and mutant products 
with MspJ, however, resulted in three fragments (3 04 bp, 85 
bp, and 38 bp) for the normal gene and two fragments (304 bp 
and 123 bp) for the mutant gene of affected children. 

2 5 Another mutation involves a G to A transition in 

intron 11 of the human sulfonylurea receptor which gives rise 
to PHHI • The transition site corresponds to position 27 of 
SEQ ID NO: 29. The G to A transition destroys a restriction 
site for Neil. Both normal and mutant PGR products resulted 
30 in 146 bp. Digestion with Neil resulted in two fragments (71 
bp and 75 bp) fragments for normal individuals, while the 
mutant sequence was not be cut by Neil and thus remained at 
146 bp. 

A method of detecting persistent hyperinsulinemic 

3 5 hypoglycemia of infancy comprising obtaining a sample 

comprising patient genomic DNA from a patient tissue sample; 



BNSDOC1D: <WO__952841 1A1J_> 



WO 95/28411 




PCT/US95/04463 



- 23 - 

amplifying sulfonylurea receptor specific DNA from said 
patient, genomic DNA to produce a test fragment; obtaining a 
sample comprising control nucleic acids from a control tissue 
sample; amplifying control DNA encoding wild type 
5 sulfonylurea receptor to produce a control fragment; 

comparing the test fragment with the control fragment to 
detect a test fragment having G to A transition at nucleic 
acid position 750 of SEQ ID NO: 26, or a G to A transition at 
nucleic acid position 27 of SEQ ID NO: 29, wherein said test 

10 fragment indicates persistent hyperinsulinemic hypoglycemia 
of infancy is also an embodiment of the present invention. 

Also within the scope of the present invention is a 
method of detecting persistent hyperinsulinemic hypoglycemia 
of infancy comprising obtaining a sample comprising patient 

15 mRNA from a patient tissue sample; reverse transcribing said 
mRNA into cDNA to produce patient cDNA; amplifying 
sulfonylurea receptor specific cDNA from said patient cDNA to 
produce a test fragment; obtaining a sample comprising 
control nucleic acids froma control tissue sample; amplifying 

2 0 control DNA encoding wild type sulfonylurea receptor to 

produce a control fragment; digesting said test fragment and 
said control fragment with an endonuclease selected from the 
group consisting of Neil and Mspl; and comparing the test 
fragment with the control fragment to detect a restriction 

25 fragment length polymorphism, wherein said restriction 
fragment length polymorphism indicates persistent 
hyperinsulinemic hypoglycemia of infancy. 

The restriction fragment polymorphisms include test 
fragments of about 304 bp and about 12 3 bp as a result of 

30 Mspl restriction and a test fragment of about 146 bp as a 

result of Neil restriction. The test fragments thus indicate 
persistent hyperinsulinemic hypoglycemia of infancy. 

In accordance with methods of the present 
invention, methods of detecting PHHI in a patient are 

35 provided comprising obtaining a patient tissue sample for 
testing. The tissue sample may be solid or liquid, a body 
fluid sample such as and not limited to blood, serum, saliva, 
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sputum, mucus, bone marrow, urine, lymph, and a tear; and 
feces. m addition, a tissue sample such as pancreatic 
tissue may be provided for the detection of PHHI in 
accordance with the present invention. 
5 A test fragment is defined herein as an amplified 

sample comprising sulfonylurea receptor specific nucleic 
acids from a patient suspected of having PHHI. A control 
fragment is an amplified sample comprising normal or wild 
type sulfonylurea receptor specific nucleic acids from an 
10 individual not suspected of having PHHI . 

The method of amplifying nucleic acids may be the 
polymerase chain reaction using a pair of primers wherein at 
least one primer within the pair is selected from the group 
consisting of SEQ ID NO: 16-24. When the polymerase chain 
reaction is the amplification method of choice, a pair of 
primers may be used such that one primer of the pair is 
selected from the group consisting of SEQ ID NOS: 17, is, 21, 
and 23 and the second primer of the pair is selected' from the 
group consisting of SEQ ID NOS: 16, 19, 20, 22, and 24. 

Nucleic acids, such as DNA (such as and not limited 
to genomic DNA and cDNA) and/or RNA (such as and not limited 
to mRNA) , are obtained from the patient sample. Preferably 
RNA is obtained. A whole blood gradient may be performed to 
isolate nucleated cells and total RNA is extracted such as by 
the RNazole B method (Tel-Test Inc., Friendswood, Texas) or 
by modification of any methods known in the art such as 
described in Sambrook et al . , Molecular Cloning: A Laboratory 
Manual, 1989, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY, incorporated herein by reference in its entirety. 

Nucleic acid extraction is followed by 
amplification of the same by any technique known in the art. 
The amplification step includes the use of at least one 
primer sequence which is complementary to a portion of 
sulfonylurea receptor specific expressed nucleic acids or 
35 sequences. Primer sequences useful in the amplification 
methods include and are not limited to SEQ ID NOS: 16-24, 
which may be used in the amplification methods. Any primer 
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sequence of about 10 nucleotides to about 35 nucleotides, 
more preferably about 15 nucleotides to about 3 0 nucleotides, 
even more preferably about 17 nucleotides to about 25 
nucleotides may be useful in the amplification step of the 
5 methods of the present invention. In addition, mismatches 
within the sequences identified above, which achieve the 
methods of the invention, such that the mismatched sequences 
are substantially complementary and thus hybridizable to the 
sequence sought to be identified, are also considered within 

10 the scope of the disclosure. Mismatches which permit 

substantial similarity to SEQ ID NOS: 16-24, such as and not 
limited to sequences with similar hydrophobicity and 
hydrophilicity , will be known to those of skill in the art 
once armed with the present disclosure. The primers may also 

15 be unmodified or modified. Primers may be prepared by any 
method known in the art such as by standard phosphoramidite 
chemistry. See Sambrook et al., supra. 

The method of amplifying nucleic acids may be the 
polymerase chain reaction using a pair of primers wherein at 

20 least one primer within the pair is selected from the group 
consisting of SEQ ID NO: 16-24. When the polymerase chain 
reaction is the amplification method of choice, a pair of 
primers may be used such that one primer of the pair is 
selected from the group consisting of SEQ ID NOS: 16-24. 

25 Primers used in mutational analysis were SEQ ID NO: 

16: CACGCTCAGGTTCTGGAT; SEQ ID NO: 17: TCAACTGGATGGTGAGGA ; 
SEQ ID NO: 18: 5' TGACATCGCCAAACTGC ; SEQ ID NO: 19: 
TCCTGGCAGTGCCTTCA; SEQ ID NO: 20: TCCTCTCAGGGTCCAGGTTA; SEQ 
ID NO: 21: ACAAGGAGCCTGGGGAT ; SEQ ID NO: 22: 

3 0 TGCATGGGTCCCAGTGA; SEQ ID NO: 23: TTGACCATTCACCACATTGGTGTGC ; 
and SEQ ID NO: 24: ACCATCGACCAGCACATC 

When an amplification method includes the use of 
two primers, a first primer and a second primer, such as in 
the polymerase chain reaction, the first primer may be 

3 5 selected from the group consisting of SEQUENCE ID NOS: 17, 
18, 21, and 23; and the second primer may be selected from 
the group consisting of SEQUENCE ID NOS: 16, 19, 20, 22, and 
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24. Any primer pairs which transcribe nucleic acids toward 
each other and which are specific for sulfonylurea receptor 
may be used in accordance with the methods of the present 
invention. 

5 Total extraction of RNA is preferably carried out. 

As used herein, the term "amplification 1 * refers to 
template-dependent processes and vector-mediated propagation 
which result in an increase in the concentration of a 
specific nucleic acid molecule relative to its initial 
10 concentration, or in an increase in the concentration of a 
detectable signal. As used herein, the term 

template-dependent process is intended to refer to a process 
that involves the template-dependent extension of a primer 
molecule. The term template dependent process refers to 

15 nucleic acid synthesis of an RNA or DNA molecule wherein the 
sequence of the newly synthesized strand of nucleic acid is 
dictated by the well-known rules of complementary base 
pairing (see, for example, Watson, J. D * et al., In: 
Molecular Biology of the Gene, 4th Ed., W* A* Benjamin, Inc., 

20 Menlo Park, Calif. (1987)). Typically, vector mediated 

methodologies involve the introduction of the nucleic acid 
fragment into a DNA or RNA vector, the clonal amplification 
of the vector, and the recovery of the amplified nucleic acid 
fragment. Examples of such methodologies are provided by 

25 Cohen et al . (U.S. Pat. No. 4,237,224), Maniatis, T. et al. , 
Molecular Cloning (A Laboratory Manual), Cold Spring Harbor 
Laboratory, 1982. 

A number of template dependent processes are 
available to amplify the target sequences of interest present 

3 0 in a sample. One of the best known amplification methods is 
the polymerase chain reaction (PCR) which is described in 
detail in U.S. Patents 4,683,195, 4,683,202 and 4,800,159, 
and in Innis et al . , PCR Protocols, Academic Press, Inc., San 
Diego CA, 1990, each of which is incorporated herein by 

35 reference in its entirety. Briefly, in PCR, two primer 

sequences are prepared which are complementary to regions on 
opposite complementary strands of the target sequence. An 
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excess of deoxynucleoside triphosphates are added to a 
reaction mixture along with a DNA polymerase (e.g., Taq 
polymerase) . If the target sequence is present in a sample, 
the primers will bind to the target and the polymerase will 
5 cause the primers to be extended along the target sequence by 
adding on nucleotides. By raising and lowering the 
temperature of the reaction mixture, the extended primers 
will dissociate from the target to form reaction products, 
excess primers will bind to the target and to the reaction 

10 products and the process is repeated. Preferably a reverse 
transcriptase PCR amplification procedure may be performed in 
order to quantify the amount of mRNA amplified. Polymerase 
chain reaction methodologies are well known in the art. 
Another method for amplification is the ligase 

15 chain reaction (referred to as LCR) , disclosed in EPA No, 
320,308, incorporated herein by reference in its entirety. 
In LCR, two complementary probe pairs are prepared, and in 
the presence of the target sequence, each pair will bind to 
opposite complementary strands of the target such that they 

2 0 abut. In the presence of a ligase, the two probe pairs will 
link to form a single unit. By temperature cycling, as in 
PCR, bound ligated units dissociate from the target and then 
serve as "target sequences" for ligation of excess probe 
pairs. U.S. Patent 4,883,750, incorporated herein by 

25 reference in its entirety, describes an alternative method of 
amplification similar to LCR for binding probe pairs to a 
target sequence . 

Qbeta Replicase, described in PCT Application No. 
PCT/US87/00880, incorporated herein by reference in its 

30 entirety, may also be used as still another amplification 
method in the present invention. In this method, a 
replicative sequence of RNA which has a region complementary 
to that of a target is added to a sample in the presence of 
an RNA polymerase. The polymerase will copy the replicative 

35 sequence which can then be detected. 

An isothermal amplification method, in which 
restriction endonucleases and ligases are used to achieve the 
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amplification of target molecules that contain nucleotide 
5'-[ alpha -thio] triphosphates in one strand of a restriction 
site (Walker, G. T, , et al ♦ , Proc. Natl. Acad, Sci . (U.S.A. ) 
1992, 89: 392-396, incorporated herein by reference in its 
5 entirety) , may also be useful in the amplification of nucleic 
acids in the present invention. 

Strand Displacement Amplification (SDA) is another 
method of carrying out isothermal amplification of nucleic 
acids which involves multiple rounds of strand displacement 

10 and synthesis, i«e. nick translation. A similar method, 
called Repair Chain Reaction (RCR) is another method of 
amplification which may be useful in the present invention 
and which involves annealing several probes throughout a 
region targeted for amplification, followed by a repair 

15 reaction in which only two of the four bases are present. 

The other two bases can be added as biotinylated derivatives 
for easy detection. A similar approach is used in SDA. 

Sulfonylurea receptor specific nucleic acids can 
also be detected using a cyclic probe reaction (CPR) . In 

2 0 CPR, a probe having a 3' and 5' sequences of non-sulf onylurea 
receptor specific DNA and middle sequence of sulfonylurea 
receptor specific RNA is hybridized to DNA which is present 
in a sample. Upon hybridization, the reaction is treated 
with RNaseH, and the products of the probe identified as 

25 distinctive products, generate a signal which is released 
after digestion. The original template is annealed to 
another cycling probe and the reaction is repeated. Thus, 
CPR involves amplifying a signal generated by hybridization 
of a probe to a sulfonylurea receptor specific expressed 

30 nucleic acid. 

Still other amplification methods described in GB 
Application No. 2 202 328, and in PCT Application No. 
PCT/US89 / 01025 , each of which is incorporated by reference in 
its entirety, may be used in accordance with the present 

35 invention. In the former application, "modified" primers are 
used in a PGR like, template and enzyme dependent synthesis. 
The primers may be modified by labelling with a capture 
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moiety (e»g., biotin) and/or a detector moiety (e.g., 
enzyme) . In the latter application, an excess of labelled 
probes are added to a sample. In the presence of the target 
sequence, the probe binds and is cleaved catalytically . 
5 After cleavage, the target sequence is released intact to be 
bound by excess probe. Cleavage of the labelled probe 
signals the presence of the target sequence. 

Other nucleic acid amplification procedures include 
transcription-based amplification systems (TAS) (Kwoh D. , et 

10 al., Proc. Natl. Acad. Scl. (U.S.A.) 1989, 85:1173, Gingeras 
T. R. , et al . , PCT Application WO 88/10315, each of which is 
incorporated herein by reference in its entirety) , including 
nucleic acid sequence based amplification (NASBA) and 3SR. 
In NASBA, the nucleic acids can be prepared for amplification 

15 by standard phenol /chloroform extraction, heat denaturation 
of a clinical sample, treatment with lysis buffer and 
minispin columns for isolation of DNA and RNA or guanidinium 
chloride extraction of RNA. These amplification techniques 
involve annealing a primer which has sulfonylurea receptor 

2 0 specific sequences. Following polymerization, DNA /RNA 

hybrids are digested with RNase H while double stranded DNA 
molecules are heat denatured again. In either case the 
single stranded DNA is made fully double stranded by addition 
of second sulfonylurea receptor specific primer, followed by 

2 5 polymerization. The double stranded DNA molecules are then 

multiply transcribed by a polymerase such as T7 or SP6. In 
an isothermal cyclic reaction, the RNAs are reverse 
transcribed into double stranded DNA, and transcribed once 
against with a polymerase such as T7 or SP6 . The resulting 

3 0 products, whether truncated or complete, indicate 

sulfonylurea receptor specific sequences. 

Davey, C. , et al . , European Patent Application 
Publication No. 329,822, incorporated herein by reference in 
its entirety, disclose a nucleic acid amplification process 
35 involving cyclically synthesizing single-stranded RNA 

("ssRNA" ) , ssDNA, and double-stranded DNA ("dsDNA") which may 
be used in accordance with the present invention. The ssRNA 
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is a first template for a first primer oligonucleotide, which 
is elongated by reverse transcriptase (RNA-dependent DNA 
polymerase) . The RNA is then removed from resulting DNA : RNA 
duplex by the action of ribonuclease H (RNase H, an RNase 
5 specific for RNA in a duplex with either DNA or RNA) . The 
resultant ssDNA is a second template for a second primer, 
which also includes the sequences of an RNA polymerase 
promoter (exemplified by T7 RNA polymerase) 5' to its 
homology to its template ♦ This primer is then extended by 

10 DNA polymerase (exemplified by the large "Klenow** fragment of 
E. coli DNA polymerase I) , resulting as a double-stranded DNA 
("dsDNA" ) molecule, having a sequence identical to that of 
the original RNA between the primers and having additionally, 
at one end, a promoter sequence. This promoter sequence can 

15 be used by the appropriate RNA polymerase to make many RNA 
copies of the DNA* These copies can then re-enter the cycle 
leading to very swift amplification* With proper choice of 
enzymes, this amplification can be done isothermally without 
addition of enzymes at each cycle. Because of the cyclical 

20 nature of this process, the starting sequence can be chosen 
to be in the form of either DNA or RNA. 

Miller, H. I., et al . , PCT application WO 89/06700, 
incorporated herein by reference in its entirety, disclose a 
nucleic acid sequence amplification scheme based on the 

25 hybridization of a promoter /primer sequence to a target 

single-stranded DNA ("ssDNA" ) followed by transcription of 
many RNA copies of the sequence. This scheme is not cyclic; 
i.e. new templates are not produced from the resultant RNA 
transcripts. Other amplification methods include "race" 

30 disclosed by Frohman, M. A., In: PCR Protocols: A Guide to 
Methods and Applications 1990, Academic Press, N.Y.) and 
"one-sided PCR M (Ohara, 0., et al . , Proc . Natl. Acad. Sci . 
(U.S.A.) 1989, 56:5673-5677), all references herein 
incorporated by reference in their entirety. 

3 5 Methods based on ligation of two (or more) 

oligonucleotides in the presence of nucleic acid having the 
sequence of the resulting "di-oligonucleotide" , thereby 
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amplifying the di-oligonucleotide (Wu, D . Y. et al . , Genomics 
1989, 4:560, incorporated herein by reference in its 
entirety) , may also be used in the amplification step of the 
present invention. 
5 Test fragment and control fragment may be amplified 

by any amplification methods known to those of skill in the 
art, including and not limited to the amplification methods 
set forth above. For purposes of the present invention, 
amplification of sequences encoding patient and wild type 

10 sulfonylurea receptor includes amplification of a portion of 
a sequence such as and not limited to a portion of the 
sulfonylurea receptor sequence of SEQ ID NO: 26, such as 
sequence of a length of about 10 nucleotides to about 1,000 
nucleotides, more preferably about 10 nucleotides to about 

15 100 nucleotides, or having at least 10 nucleotides occurring 
anywhere within the SEQ ID NO: 26, where sequence differences 
are known to occur within sulfonylurea receptor test 
fragments. Thus, for example, a portion of the sequence 
encoding the second nucleotide binding fragment (NBF-2) 

20 region of sulfonylurea receptor of a patient sample and a 
control sample may be amplified to detect sequence 
differences between these two sequences. 

Following amplification of the test fragment and 
control fragment, comparison between the amplification 

25 products of the test fragment and control fragment is carried 
out- Sequence differences such as and not limited to nucleic 
acid transition and restriction digest pattern alterations 
may be detected by comparison of the test fragment with the 
control fragment. Nucleic acid transition includes and is 

3 0 not limited to a G to A transition at nucleic acid position 
750 of SEQ ID NO: 26. Another nucleic acid transition 
involves a G to A transition at nucleic acid position 27 of 
SEQ ID NO: 29. 

These nucleic acid transitions lead to restriction 

3 5 fragment length polymorphisms as exemplified by the altered 
results following Hspl and Neil restriction digests set forth 
above. Accordingly, the restriction fragment length 
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polymorphisms of test fragments may be compared to the 
restriction fragments of control fragments. 

Alternatively, the presence or absence of the 
amplification product may be detected. The nucleic acids are 
5 fragmented into varying sizes of discrete fragments. For 
example, DNA fragments may be separated according to 
molecular weight by methods such as and not limited to 
electrophoresis through an agarose gel matrix. The gels are 
then analyzed by Southern hybridization. Briefly, DNA in the 

10 gel is transferred to a hybridization substrate or matrix 

such as and not limited to a nitrocellulose sheet and a nylon 
membrane. A labelled probe encoding a sulfonylurea mutation 
is applied to the matrix under selected hybridization 
conditions so as to hybridize with complementary DNA 

15 localized on the matrix. The probe may be of a length 

capable of forming a stable duplex. The probe may have a 
size range of about 2 00 to about 10,000 nucleotides in 
length, preferably about 500 nucleotides in length, and more 
preferably about 2,454 nucleotides in length. The preferred 

2 0 sequence of the probe is set forth in SEQ ID NO: 30. 

Mismatches which permit substantial similarity to SEQ ID NO r 
30, such as and not limited to sequences with similar 
hydrophobicity and hydrophilicity , will be known to those of 
skill in the art once armed with the present disclosure. 

25 Various labels for visualization or detection are known to 
those of skill in the art, such as and not limited to 
fluorescent staining, ethidium bromide staining for example, 
avidin/biotin, radioactive labeling such as 32 P labeling, and 
the like. Preferably, the product, such as the PGR product, 

30 may be run on an agarose gel and visualized using a stain 
such as ethidium bromide. See Sambrook et al . , supra. The 
matrix may then be analyzed by autoradiography to locate 
particular fragments which hybridize to the probe. Yet 
another alternative is the sequencing of the test fragment 

35 and the control fragment to identify sequence differences* 
Methods of nucleic acid sequencing are known to those of 
skill in the art, including and not limited to the methods of 
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Maxam and Gilbert , Proc . Natl. Acad. Sci., USA 1977, 74, 560- 
564 and Sanger, Proc. Natl. Acad. Sci . , USA 1977, 74, 54 63- 
5467. 

A diagnostic kit for detecting PHHI comprising in 
5 one or more containers at least one primer which is 

complementary to a sulfonylurea receptor sequence and a means 
for visualizing amplified DNA is also within the scope of the 
present invention. Alternatively, the kit may comprise two 
primers. In either case, the primers may be selected from 

10 the group consisting of SEQ ID NOS: 16-24, for example. The 
diagnostic kit may comprise a pair of primers wherein one 
primer within said pair is complementary to a region of the 
sulfonylurea receptor gene, wherein one of said pair of 
primers is selected from the group consisting of SEQ ID NO: 

15 16-24, a probe specific to the amplified product, and a means 
for visualizing amplified DNA, and optionally including one 
or more size markers, and positive and negative controls. 
The diagnostic kit of the present invention may comprise one 
or more of a fluorescent dye such as ethidium bromide stain, 

2 0 32 P, and biotin, as a means for visualizing or detecting 

amplified DNA. Optionally the kit may include one or more 
size markers, positive and negative controls, restriction 
enzymes such as and not limited to Mspl and/or Neil f and/or a 
probe specific to the amplified product* 
25 The following examples are illustrative but are not 

meant to be limiting of the invention. 

EXAMPLES 

Purification and Partial Characterization of the 14 0 kDa 
Receptor: 

3 0 HIT cell membranes were photolabeled using a 

radioiodinated derivative of the second generation 
hypoglycemic drug, glyburide, according to the methods of 
Nelson, D.A. , et al . , JBC , 1992, 267:14928, Aguilar-Bryan , 
L., et al., JBC, 1992, 267:14934, and Aguilar-Bryan, L. , et 
35 al . , JSC, 1990, 265:8218, the disclosures of which are hereby 
incorporated by reference in its entirety. 
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Glyburide (Kramer et al. FEBS Lett. 1988 229:355- 
3 59) and an iodinated derivative of glyburide (Aguilar-Bryan 
et al* j*. Biol. Chem. 1990 -265:8218-8224) are known to 
photolabel a 14 0 kDa polypeptide. The pharmacological 
5 characteristics of the photolabeling, a kD in the low 

nanomolar range, and appropriate rank order of displacement 
with other insulin-releasing sulfonylureas, are those 
expected from studies on glyburide- induced insulin release 
from islets (Panten et al. Biochem. Pharm. 1989 38:1217-1229) 

10 and /3-cell lines (Schmid-Antomarchi et al. J. Biol. Chem. 

1987 262:15840-15844) and inhibition of K ATP channel activity. 
Glyburide was purchased from Sigma (St. Louis, MO) and 
prepared in stock solutions of 10 mM in dimethyl sulfoxide. 
Radioligand stocks were prepared by diluting high pressure 

15 liquid chromatography-purif ied 5-[ 12S ] iodo-2-hydroxyglyburide 
in dimethyl sulfoxide. Specific activity (cpm/mol) was 
measured on radioligand diluted 1/1000 into lOmM Tris, 100 mM 
NaCl, 2 mM EDTA, pH 7.4, and the absorbance determined at 2.5 
nm intervals in a UV-VIS Gilford spectrophotometer. 

20 Dimethylsulf oxide was diluted 1/1000 into the same buffer, 
and the absorbance of the buffer without drug was subtracted 
at each wavelength to generate the final absorbance profile, 

HIT cells, passage 67-73, were seeded in roller 
bottles at 50 x 10 6 cells/bottle in 100 ml of Dulbecco's 

25 modified Eagle's medium plus 10% fetal bovine serum. Cells 
were fed with 2 00 ml of medium plus serum 4-5 times over a 
period of 2 weeks until the cells were confluent. After 
plating and each feeding, bottles were gassed with 5% C0 2 
prior to capping. 

30 The cells in confluent roller bottles were washed 

with phosphate-buffered saline (0.14 M NaCl, 3 mM KC1, 2 mM 
KH 2 P0 6 , 1 mM Na 2 HP0 4/ pH 6.8) and then incubated at room 
temperature with 25 ml of phosphate-buffered saline plus 2 mM 
EDTA until cells detached from the sides of the bottles. 

35 Cells were pelleted at 900 xg for 10 minutes at 4 °C. 

All steps were carried out at 0-4 °C. Cell pellets 
were resuspended in 5 mM Tris, 2 mM EDTA , 0.1 mM PMSF, pH 
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7*4, using approximately 5 ml of buffer for each roller 
bottle* Cells were placed on ice for 4 0 minutes to allow 
swelling and then homogenized with 10 strokes of a motorized 
glass-Teflon homogenizer (500 rpm) . The homogenate was 
5 centrifuged at 1000 xg for 10 minutes to remove nuclei and 
cellular debris, and the supernatant transferred to 3 0 ml of 
Beckman polycarbonate, screw-cap ultracentrif uge tubes, 
Supernatants were centrifuged at 100,000 xg for 60 minutes in 
a Beckman 60 Ti rotor. The pellets were resuspended in 

10 membrane storage buffer (10 mM Tris, 100 mM NaCl, 2 mM EDTA, 
20% glycerol, 0*1 mM PMSF, pH 7.4). 200 mg of membrane 
protein were typically obtained from 20 roller bottles. 

Membranes were stored at -80° c at 5 mg/ml protein 
in 10 mM Tris (pH 7.5), 0.1 M NaCl, 2 mM EDTA, 20% glycerol. 

15 To monitor receptor purification, an aliquot (5-20 ml) of the 
membranes was incubated with 1 nM [ 125 I]-iodo-2- 
hydroxyglyburide for 15 minutes and the sample photolabeled. 
Binding of 5- [ 125 I]-iodo-2-hydroxyglyburide (5-10 nM) to 
membranes was done for 3 0 minutes at 23 °C. Aliquots were 

20 pipetted onto parafilm and irradiated at 23 °C in a UV cross- 
linker (Fisher Scientific) . The energy settings for the UV 
cross-linker were factory calibrated at 254 nm. For cross- 
linking at 312 nm, a conversion factor was estimated by 
determining the time required for the UV cross- linker to 

25 deliver a specific amount of energy with each set of bulbs, 
and then multiplying by the ratio of these times. 

All subsequent steps were performed at room 
temperature in the presence of 0,1 mM PMSF, 0.1 mM 
phenanthroline and 0.1 mM iodoacetamide. 2 0% (w/v) digitonin 

.30 was freshly prepared by boiling in deionized water, then 
added to 200-400 mg thawed labeled membranes to a final 
concentration of 1%. Membranes were solubilized for 15 
minutes then sedimented for 1 hr at 100,000 xg. The 
supernatant was divided into 4 ml aliguots and each aliquot 

35 was chromatographed over a 1 ml Concanavalin A-Sepharose 

column equilibrated with 2 5 mM Tris-HCl, pH 7.5,0.1 M NaCl, 2 
mM EDTA, 1% digitonin. The solution was cycled through the 
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column "twice before washing the column with 8 ml of the 
equilibration buffer. Retained protein was eluted with 4 ml 
of the same buffer containing 0.5 M methyl a-D- 
mannopyranoside. The eluted protein was stored at -80 °c. 
5 Three Con A eluates were combined, then cycled twice over a 1 
ml column of reactive green 19-agarose equilibrated with 5 0 
mM HEPES (pH 8.5), 2 mM EDTA, 0.2% digitonin. The column was 
washed with 8 ml of the equilibration buffer followed by 8 ml 
of the same buffer containing 0.4 M NaCl. The retained 

10 protein was eluted with 4 ml of the equilibration buffer 

containing 1.5 M NaCl. The two pooled eluates were diluted 
1:1 with the HEPES equilibration buffer without NaCl and 
cycled twice over a 1 ml phenylboronate-10 agarose column. 
The column was washed with 8 ml of the HEPES buffer, followed 

15 by 2 ml of 0.1 M Tris-HCl, pH 7.5, 2 mM EDTA, 0.1% digitonin. 
Protein was eluted with 4 ml of 0.1 M Tris (pH 7.5), 2 mM 
EDTA, 0.1 % SDS. The protein was concentrated to 0.5 ml 
using a 100,000 MW cutoff Amicon filter, pretreated with 5% 
Tween-20, then loaded onto a single 5 cm wide lane of a 5.5% 

20 polyacrylamide SDS gel. After electrophoresis the gel was 

stained with Coomassie blue, destained, and the receptor band 
excised with a razor blade. The receptor was electroeluted 
into a 14,000 MW cutoff dialysis bag and concentrated by 
Amicon filtration. 

25 Table 1 summarizes the yields and fold-purification 

in the scheme developed for receptor purification. The 
amount of receptor, yields, and fold-purification reported 
after each step are based on the radioactivity, determined by 
7 counting, in the 14 0 kDa band after electrophoresis 

3 0 relative to the total protein loaded on a gel lane {as 
determined using the BioRad protein assay) . HIT cell 
membranes contain approximately 1.6 pmol of receptor per mg 
of membrane protein as determined by filtration binding 
(Aguilar-Bryan et al. J . Biol. Chem. 1990 265:8218-8224). 
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TABLE 1 

Purification of the High Affinity 140 kDa Sulfonylurea Receptor from HIT cells 



Step 


Total Volume 
mi 


Total Protei 
n 

mg 


Receptor 
pmol 


Receptor 
pmol/mg 


Purification 
-fold 


Yield 

% 


Crude 
Membranes 


90 


200 


320 


1.6 


1 


100 


Supernatant 


90 


150 


240 


1.6 


1 


75 


ConA- 
Sep ha rose 


48 


10.2 


80 


7.8 


4.9 


25 


Reactive 

Green 

19-agarose 


16 


1.8 


56 


31.1 


19.5 


IS 


Phenyl 

boronate 

agarose 


4 


0.56 


45 


80.4 


50.4 


14 


SDS-PAGE 
electroelute 


0.2 


-0.002 


8 


4000 


2507 


2.5 



For the autoradiogram depicted in Figure 1, 1-2 
of purified, radiolabeled receptor was made 1% in /3* 
octylglucoside and divided into 6 aliquots. Lane 1 contained 

2 0 receptor kept on ice. The receptor was incubated in the 

presence (lane 2) and absence (lane 3) of Endo F for 30 min 
at 37 °C. Aliquots of the samples for lanes 1-3 were further 
incubated with V8 protease (1 /ug/10 ^1) for 30 min at 37 °C, 
yielding two radiolabeled peptides of 66 and 49 kDa (lanes 4 

25 and 6) , both of which are N-glycosylated as indicated by the 
mobility shift after endo F treatment (lane 5), To obtain N- 
terminal sequence from the intact receptor, 2 ixg of protein 
was separated by electrophoresis on a single, 0.8 cm wide 
lane of a 5.5% gel. The receptor was transferred to ProBlot 

.30 (Applied Biosystems) in 10 mM CAPS (pH 11), 10% MeOH, the 
filter stained for 10-20 seconds with Coomassie blue, 
destained, the band excised and microsequenced. To prepare 
receptor fragments for microsequencing , 10 fig of purified 
receptor was cleaved with V8 , electrophoresed on a single 

35 lane and the fragments from the partial digest transferred to 
ProBlot. Fragments were prepared and sequenced multiple 
times as indicated in the figure. Gels used in the 



BNS0OC1D: <WO_95284l 1 A1 J_> 



WO 95/28411 



PCT/US95/04463 



- 38 - 

preparation of receptor and fragments for microsequencing 
were aged overnight, and the top tray buffer contained 0,1 mM 
thioglycolate . 

The purified receptor showed a small apparent 
5 molecular weight decrease (AM r ~3000) following treatment with 
Endoglycosidase F/N-glycosidase F (Endo F) and yielded two 
bands following limited cleavage with V8 protease (Figure 1) . 
Each of the major labeled proteolytic fragments, M r ~69 and 49 
kDa, shift mobility after digestion with Endo F. Identical 

10 N-terminal sequence, 15-2 5 residues, were recovered from each 
of the major labeled peptides. No residue was obtained at 
residue 9 when the glycosylated peptides were sequenced; an 
aspartic acid was identified at residue 9 (see Figure 3) in 
the deglycosylated receptor indicating this is an N- 

15 glycosylated asparagine. In addition, N-terminal sequences 
were recovered on two unlabeled V8 peptides and a third minor 
labeled peptide. The results indicate there is an N-linked 
glycosyl group at residue nine in the mature receptor, 
suggesting that the N terminus is extracellular, and that the 

20 sulfonylurea labeling site is within the first 50 kDa of the 
receptor . 

Two multiple antipeptide antibodies (MAPs) , 
directed against residues 1 through 8 and 10 through 20 
respectively, of Figure 3, both immunoprecipitate 

25 photolabeled 140 kDa receptors from HIT, mouse aT06 , and rat 
insulinoma (RIN) cells. MAPs were prepared by synthetic 
protein sequencing (Perkin Elmer-ABI, 430 A Peptide 
Synthesizer, Foster City, CA) to obtain antibodies to M-P-L- 
A-F-C-G-T, residues 1-8 of SEQ ID NOS: 5 and 7. This process 

30 was repeated for residues 10-20 of SEQ ID NOS: 5 and 7, N-H- 
S-A-A-Y-R-V-D-Q-G. A purified sulfonylurea receptor protein 
was immunoprecipitated from HIT cells using the MAPs prepared 
as set forth above. 

HIT cell membranes were incubated with 5-[ I]iodo- 

35 2-hydroxyglyburide, photolabeled, solubilized with 1% 
digitonin, centrifuged at 100,000 xg and the supernatant 
incubated with 1/10 volume of preimmune serum, immune serum, 
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immune serum + anti~MAP 10*2 0, or immune serum + irrelevant 
MAP peptide. 50 ill of protein A-Sepharose was added and the 
mixture was incubated for 2 hours at room temperature , the 
beads washed with buffer, heated in the presence of pH 9 
5 sample buffer, eletrophoresed on a 6% polyacrylamide SDS gel, 
and an autoradiogram prepared. 

The immunoprecipitation was competed using the 
immunizing peptide, but not the other MAPS (Figure 2) . The 
amino acid sequence is derived from the photolabeled protein 
10 and the N-terminal amino acid sequence is conserved between 
mouse, rat, and hamster* 

Isolation and Characterization of cDNA Clones: 

Degenerate PGR primers with flanking restriction 
sites were designed based on the sequence obtained from the 
15 labeled peptides. The primers used were as follows: 
primer 1 (SEQ ID NO: 10): 

5' GAGAGAAGCTT(T/C)TG(T/C)GG<T/C/G/A)GA(A/G)AA(T/C)CA-3' 

primer 2 (SEQ ID NO: 11): 

5' GAGAGAGAATTCC(T/C)TO(A/G)TC(T/C/0/A)AC(T/C/G/A)C(G/T) (A/G)TA-3 ' 

20 The bases in parenthesis indicate the degeneracy at 

that position. The sequence in bold was derived from the 
peptide sequence obtained from the N-terminus of the 
sulfonylurea receptor* The remaining 5' sequence was added 
to facilitate subcloning. Primer 1 has a Hindlll site at the 

25 5' end; Primer 2 was engineered with an EcoRI site at the 5' 
end. These primers were used in a standard PCR reaction with 
a random primed cDNA library, constructed in XZAPII using 
mouse a-cell poly A+ mRNA, as template. The following cycle 
times and temperatures were employed: 94 °C for 10 minutes; 

30 85 °C for 3 minutes, [50 °C for 2 minutes, 72 °C for 5 

minutes, 94 °C for 2 minutes,] 50 °C for 2 minutes, 72 °C for 
5 minutes. 

The bracketed conditions were cycled 3 0 times. 
From the N-terminal peptide sequence of the receptor a 47 
35 base pair coding region was expected to be amplified plus the 
20 base pairs added to the primers to facilitate cloning 
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yielding an expected 67 base pair product. The 47 base pair 
coding region was predicted to have 14 base pairs that were 
not present in the primers. The PCR product obtained was 
approximately 67 base pairs and was restricted with EcoRI and 
5 Hindlll, subcloned into M13 and sequenced. The resulting 
sequence gave the expected 14 base pairs indicating the 
sequence was derived from the receptor. The 47 base pair 
oligonucleotide given below was synthesized based on the 
consensus sequence derived from nine M13 clones: 

10 5' TTTTGCGGGACGGAGAATCACTCGGCCGCCTACCGCGTCGACCAAGG-3 ' 

(SEQ ID NO: 12) . This oligonucleotide was used to screen the 
random primed mouse aTC-cell cDNA library. 

A 1.1 kb cDNA was cloned which encoded 28 amino 
acids obtained from peptide sequencing. This cDNA fragment 

15 was used to screen RIN and HIT cell cDNA X libraries to 
obtain full sequence. 

The nucleotide sequence of a 463 5 bp rat receptor 
cDNA is given in Figure 3. The open reading frame encodes a 
1498 amino acid protein with a mass of 167,834 daltons, 

2 0 larger that predicted by SDS polyacrylamide gel 

electrophoresis. Aguilar-Bryan, L. , et al . f JBC , 1990, 
255:8218. The amino acid sequence of the hamster receptor, 
immediately below the rat sequence in Figure 3, is 
approximately 98% identical. There is a single insertion of 

25 an asparagine at position 742 and a deletion of a threonine 
at position 831, The first difference between the hamster 
and rat sequences is in the same relative position, 21 
residues Oterminal of the Walker consensus site, as the 
AF508 deletion seen in a common cystic fibrosis 

30 transconductance regulator (CFTR) mutation (Riordan et al. 
Science 1989 245:1066-1073). 

In addition to the insertion and deletion, the first 
nucleotide binding fold contains approximately a third 
(10/33) of all differences between the two species. 

3 5 The mature rat protein, defined by peptide 

sequencing, begins with a proline following the methionine 
start site. In the RIN cell receptors the adjacent amino 



BNSDOClD: <WO 952841 1A1_I_> 



WO 95/284 U 



PCT/US95/04463 



- 41 - 

acid is a methionine. This is the initiating methionine 
based on the surrounding sequence which is a good fit to the 
consensus pattern for initiation, GCC (A/G) CCAVG (G) (SEQ ID 
NO: 13) (Kozak, M. Cell 1986 44:283), including the strongly 
5 conserved A at position -3. However, in the mouse receptor, 
an additional 35 amino acids is found preceding this proline 
which cannot eliminate the possibility that some forms of the 
hamster and rat receptors have similar leader sequences. 
Confirming the chemical sequence, residue 9 in the mature 
10 proteins is an asparagine within a consensus glycosylation 
site, 

A Blast search of the National center for 
Biotechnology Information (NCBI) nucleotide database with the 
receptor sequence produced matches with several members of 

15 the P-glycoprotein/multidrug resistance protein family. A 
similar search with the amino acid sequence indicated the 
sulfonylurea receptor is a member of the ATP-binding cassette 
superfamily with two putative nucleotide binding domains. 
The sulfonylurea receptor sequence revealed 2 9% similarity, 

20 to an ATP-binding cassette superfamily member, termed a 

multidrug resistance-associated protein (MRP) , isolated from 
a small cell lung carcinoma cell line (H69AR) selected with 
doxorubicin (Cole et al- Science 1992 258:1650-1654) see 
Figure 4. A cluster analysis of this molecule, dvhuar in the 

25 Protein Identification Resource (PIR) database, indicates it 
is related to the leishmania P-glycoprotein-related molecule 
(L,ei/PgpA), the CFTRs (human (Hum/CFTR) , bovine (Bov/CFTR) , 
mouse (Mus/CFTR) , and dogfish (Squ/CFTR)) (Cole et al. 
Science 1992 258 : 1650-1654 ) . A similar result was obtained 

30 for the sulfonylurea receptor with the additional inclusion 
of the Xenopus CFTR indicating the receptor is a member of 
this cluster. 

The identification of the nucleotide binding 
domains goes beyond simply having Walker "A" and M B" 

35 consensus sequences. The receptor is similar to the 230-240 
amino acid nucleotide binding domain (s) described by (Hyde et 
al. Nature 1990 546:362-365) and database searches find 
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similarities to the nucleotide binding fold of ATP-binding 
proteins. The more conserved of the two receptor nucleotide 
binding folds, based on similarity with other ATP-binding 
proteins and the comparison of the rat and hamster sequences, 
5 is at the C-terminal end. An alignment of this region with 
four of the cystic fibrosis transconductance regulator 
(CFTRs) and multidrug resistance proteins (MRPs) with 
similarity to the sulfonylurea receptor are shown in Figure 
5, 

10 RKA Analysis: 

Northern blot analysis of poly A+ mRNA isolated 
from RIN, HIT and otTC-6 cells, previously shown to have the 
high affinity receptor by drug binding and photolabeling 
studies (Aguilar-Bryan et al J, Cell. Biochem. Suppl . 1994 
15 18 A: 13 3) each have an approximately 5000 nucleotide 

transcript, see Figure 6. A preliminary tissue distribution 
study shows the same size transcript is present in mouse 
brain and heart. 

Predicted Protein structure: 

20 Sequence similarities indicate the sulfonylurea 

receptor has two potential ATP binding folds. The size and 
additional sequence similarities with P-glycoproteins and 
CFTRs suggest the receptor has a similar structure. 
Hydrophobic ity (Figure 7) and hydrophobicity versus 

25 hydrophobic moment (Eisenberg et al. J . Mol . Biol. 19S4 

179:125) plots were used to generate a model for the receptor 
(Figure 8) . Two constraints were imposed on the model 
structure: the glycosylation site is on the external face of 
the membrane and both nucleotide binding domains are on the 

30 internal face. The 'classical' ATP-binding cassette 

superfamily model proposes duplication of a unit consisting 
of six transmembrane spanning helices followed by a 
nucleotide binding domain. The sulfonylurea receptor differs 
from this model and has at least nine potential transmembrane 

35 helices before the first nucleotide binding domain but only 
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four between the two nucleotide binding domains (Figure 8) . 
The multidrug resistance-associated protein (MRP) is 
predicted to have 8 transmembrane spanning helices (Cole et 
al. Science 1992 258:1650-1654). 
5 Phosphorylation has been implicated in regulation 

of K ATP channel activity (Schwanstecher et al. J*. Pharmacol, 
Exper. Ther. 1992 262:495-502) and has been proposed to 
change the affinity of the sulfonylurea receptor for various 
ligands. There are 21 potential phosphorylation sites in the 

10 receptor sequence; 3 protein Kinase A (pKA) sites and 18 
protein kinase C (pKC) sites. The pKA site at 278 is 
predicted to be on the external face of the membrane, while 
those at positions 1363 and 1417 are in the second nucleotide 
binding fold. Four of the pKC sites (positions 151, 200, 304 

15 and 1213) are predicted to be extracellular or in a membrane 
spanning helix. Seven of the remaining 14 are in the 
nucleotide binding folds (NBF) ; 4 in NBF-1, and 3 in NBF-2. 
One of the latter sites, Thr 1297 in the Walker A consensus 
site, is expected to alter nucleotide binding if it is 

20 accessible for phosphorylation. 

Functional Properties, In Vitro Translations: 

mRNA, transcribed by SP6 RNA polymerase from the 
rat cDNA subcloned into pGEM4 , was translated in vitro. 
Approximately 0.5 ^g of mRNA was heated to 70 °C for 10 

2 5 minutes, immediately cooled on ice then added to rabbit 

reticulocyte lysate (Promega, Madison, WI) supplemented with 
ribonuclease inhibitor, an amino acid mixture, and 
[ 3S S] methionine. The reaction mixture was incubated at 3 0 °C 
for 60 minutes then aliguots were subject to electrophoresis 

3 0 on SDS polyacrylamide gels using standard protocols. The 

gels were dried and autoradiographed. 

The resulting protein was approximately 137 kDa, 
indicating the receptor behaves anonymously on SDS 
polyacrylamide gels having a faster than expected mobility, 
3 5 see Figure 9a. A similar anomalous behavior has been 

reported for CFTRs (Gregory et al. Mature 1990 347:382-386). 
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Anti-Nucleotide Fold Antibodies Immunoprecipitate the 
Photolabeled 14 0 XDa Receptor: 

Antibodies were produced against two fusion 
proteins containing the two nucleotide binding folds. 
5 Fragments of the receptor cDNA were subcloned in frame into 
pMALc (New England BioLabs, Boston, MA) at the C-terminal end 
of the DNA encoding the maltose binding protein (MBP) . A 
plasmid expressing the first nucleotide binding fold fused to 
MBP was constructed by restricting pMALc with StuI and Sail 

10 and restricting the sulfonylurea receptor cDNA with PvuII 

plus Xhol. A unique 500 base pair fragment was gel purified 
from the receptor cDNA digest and subcloned into pMALc. The 
construction was verified by sequencing. The receptor 
segment expressed is leu708 to leu874. Expression was 

15 obtained in E. coli following transformation and induction by 
isopropylthiogalactoside per the manufacturer's directions. 
The expressed proteins were found to be in inclusion bodies 
which were solubilized in SDS and separated on SDS 
polyacrylamide gels, see Figure 9b. The fusion protein was 

2 0 electroeluted, concentrated, and used as an immunogen. The 

solubilized protein in 2 00 fig amounts, with complete, or 
incomplete Freund's adjuvant, was injected interdermally into 
rabbits using a standard 2-3 week regimen of bleeding and 
boosting. 

25 Injection of Xenopus Oocytes with Receptor mRNA: 

HiRNA, approximately 50 ng, transcribed as described 
above, was injected into Xenopus oocytes. The injected 
oocytes were assayed for K+ channel activity after 1-5 days 
using both two-electrode and patch clamp methods. New K+ 

3 0 currents in the injected oocytes were not detected. 

Similarly, co-injection of mRNAs transcribed from cDNAs 
encoding two small inward rectifiers, R0MK1 (Ho et al. Nature 
1993 362:31-38) or a brain homolog of IRK1 (Kelly et al. 
Biophysical J\ 1994 66 (2) :A109 ) failed to confer sulfonylurea 
3 5 sensitivity on these K-f channels. The results suggest that 
the 140 kDa receptor does not have intrinsic K+ channel 
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activity , or that Xenopus oocytes are not an adequate 
background for their expression. 

Trans f action Experiments: 

The sulfonylurea receptor cDNA has been ligated 
5 into eukaryotic expression vectors containing SV4 0 virus , 
adenovirus and cytomegalovirus (CMV) promoters. These 
plasmids have been transfected into COS cells which do not 
have the high affinity sulfonylurea receptor as determined by 
filtration binding and photolabeling studies. To date 

10 experiments with the SV4 0 plasmid have shown that the 

transfected cells produce an mRNA of the appropriate size as 
determined by Northern blots with receptor cDNA. Metabolic 
labeling experiments with the SV40 plasmid where transfected 
and non-transfected cells were labeled with [ 35 S] methionine 

15 indicate that the transfected, but not the non-transfected 
cells, synthesize an appropriate sized protein which can be 
immunoprecipitated with the antinucleotide binding fold 
antibodies. The level of receptor synthesized by COS cells 
using this promoter has been low using SEQ ID NOS: 4 and 5. 

20 Expression levels are high using SEQ ID NOS: 32, 33, 35, and 
36 from rat and hamster. 

Chromosomal localization of the Sulfonylurea Receptor Gene 

Chromosomal localization of the Sulfonylurea 
Receptor (SUR) gene to normal male human banded chromosomes 

25 was determined by utilization of the fluorescence in situ 
hybridization (FISH) technique by staining with 4,6- 
diamidino-2-phenylindole (DAPI) . A raetaphase spread showed 
the two chromosome li homologues which map the SUR cDNA to 
llpl5,l. Overlapping human SUR cDNA plasmids "mid" and "3", 

30 totaling 3.8 kb, were labeled with biotin*-14-dATP (GIBCO) and 
hybridized in situ to standard metaphase spreads from normal 
male peripheral blood lymphocytes, according to the methods 
of P. Lichter et al., Science 247, 64 (1990), the disclosure 
of which is hereby incorporated by reference in its entirety. 

35 The biotin-labeled DNA was detected using Fluorescein-Avidin 
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DCS (Vector Laboratories, Burlingame, California) . 
Chromosomes were identified by simultaneous DAPI staining, 
which produces a Q-banding pattern. Fifteen metaphases were 
analyzed. Digital images were obtained with a cooled charge- 
5 coupled device camera mounted on a standard epif luorescent 
microscope (Axioplan; Zeiss, Thronwood, New York) , Images 
were acquired using the software ISee (Inovision Co.) running 
on a Sun workstation. Fluorescein isothiocyanate and DAPI 
fluorescence were recorded separately as gray scale images 

10 and then merged using the software package NIH 1.55 (J.W. 

Ijdo, E . A. Lindsay, R.A. Wells, A. Baldini, Genomics 14, 1019 
(1992)). Eighty-five per cent of metaphases analyzed showed 
specific hybridization signal on both chromatids of the two 
chromosomes 11 at llpl5.1. 

15 Partial cDNA clones, comprising 3.8 kb of coding 

sequence of the human homologue of SUR, were obtained from a 
human pancreatic cDNA library (provided by Graeme Bell, 
University of Chicago, and commercial libraries of Clontech, 
Palo Alto, CA and Invitrogen, San Diego, CA) . The library 

2 0 was produced in lambda gtlO phage (Bell RIN library) and 

screened with a 2294 bp hamster cDNA probe encoded by SEQ ID 
NO: 31. 

The protocol for making the library is provided by 
Sambrook et al., supra. Poly A+ mRNA was isolated using an 

25 oligo dT column. Poly A-f mRNA was incubated with oligo dT 
and random hexamers plus reverse transcriptase (such as MMLV 
RT from Promega, Stratagene or NEBL) and dNTPs to produce 
single strand cDNA. The single strand cDNA is treated with 
E. coli DNA polymerase, RNAseH and dNTPs, then ligated to 

30 linkers that have EcoRI sites to produce double stranded DNA. 
The final product is restricted with EcoRI and ligated, using 
T4 DNA ligase, into lambda phage DNA that has been similarly 
restricted and dephosphorylated with alkaline phosphatase to 
prevent self ligation. The ligated product is packaged into 

3 5 phage using commercially available packaging extracts. 

Screening involved plating and hybridizing at 55°C 
or 65°C in 5X or 6X SSC (according to the methods of 
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Sambrook, et al . ) . 55°C was used for cross species screens 
and 65°C was employed for the same species. Two washes were 
carried out at room temperature using 2X SSC, then one at the 
hybridization temperature of 65°C using 0.1X SSC* 
5 Hybridizations and washes were done at reduced 

stringency (55 °C) using methods according to F.M. Ausubel et 
al.. Current Protocols in Molecular Biology (Greene 
Publishing Associates, Inc, New York, NY, 1989), Chap. 6, the 
disclosures of which are hereby incorporated by reference in 

10 their entirety. Subsequent screening was done at higher 
stringency (65°C) , using a human cDNA of SEQ ID NO: 29 
obtained from the first screen as a probe. 

Characterization of these cDNA clones by sequence 
analysis revealed an overall homology of 95% with the rat SUR 

15 gene. A specific hybridization signal was detected at the 
band llplS.l in 85% of metaphases on both chromatids of the 
two chromosomes 11* 

Detection of Sulfonylurea Receptor Mutations 
in PHHI Affected Individuals 

20 Mutational analysis was performed on samples from 

16 affected progeny of nine consanguineous matings. In each 
case, diagnosis of PHHI was based on criteria established by 
A. Aynsley-Green et al., supra., the disclosure of which is 
hereby incorporated by reference in its entirety* The 

25 parents in six families were first cousins, in two families 
second cousins, and in one family more distantly related. 
Eight families were of Saudi Arabian origin, recruited from 
the patient population of the Arabian American Oil Company 
Hospital Medical Services Organization, after institutional 

3 0 approval was received, and one was of Germanic origin. 
Family labels follow the form of Thomas et al., supra. 

Studies indicated that no major insertions or 
deletions of the SUR locus had occurred in three of the 
families. The first region evaluated, by direct sequence 

35 analysis, was the second nucleotide binding fold (NBF-2) of 
the human SUR homologue (Figure 10) . This is the most highly 
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conserved region of the SUR gene, and in other superfamily 
members it, as well as NBF-l, has functional importance for 
control of channel activity through interaction with 
cytosolic nucleotides* S.C. Hyde, Nature 346, 362 (1990) and 
5 M.J. Weish r A.E. Smith, Cell 73, 1251 (1993), 

To obtain this genomic structure, a normal human 
lymphocyte genomic bacteriophage library (provided by Mary 
Beth Humphrey, Baylor College of Medicine) was screened, 
using standard methods according to P.M. Ausubel et al. ; 

10 supra., with a human partial SUR cDNA probe of SEQ ID NO: 29 
(cDNA probe, "3 prime") . 

The human genomic library was made in lambda FIX 
using materials supplied by Stratagene, Inc. Briefly, 
genomic DNA was partially digested with Sau3A, the fragments 

15 were precipitated with ethanol, resuspended with precut 

lambda FIX DNA which has compatible ends, ligated with T4 DNA 
ligase and packaged and screened. 

Hybridizations and washes were done at reduced 
stringency (55°C) using methods according to F.M. Ausubel et 

20 al., Current Protocols in Molecular Biology (Greene 

Publishing Associates, Inc, New York, NY, 1989), Chap. 6, the 
disclosures of which are hereby incorporated by reference in 
their entirety. The library was screened with a 1.2 kb 
hamster cDNA probe of SEQ ID NO: 30, which spans the SUR NBF2 

25 sequence. Subsequent screening was done at higher stringency 
(65°C), using a human cDNA of SEQ ID NO: 29 obtained from the 
first screen as a probe. Screening involved plating and 
hybridizing at 55°C or 65 °C in 5X or 6X SSC (according to the 
methods of Sambrook, et al.). 55°C was used for cross 

3 0 species screens and 65 °C was employed for the same species. 
Two washes were carried out at room temperature using 2X SSC, 
then one at the hybridization temperature of 65° C using 0.1X 
SSC ♦ 

Inserts in the bacteriophage clone XG4 were 
35 subcloned into pBluescript 11 (Stratagene, La Jolla, 

California) . Plasmids were purified using standard cesium 
chloride purified methods, restricted using the appropriate 
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desired enzyme (s) . The fragments were purified by 
electrophoresis on low melt agarose and cut out of the gel, 
A l-to-5 microliter aliquot of the desired fragment and 1 
microgram of the appropriately restricted plasmid carrying a 
5 selectable ampicillin resistance marker (such as pBluescript 
from Stratagene, Inc.) were melted at 65°C f mixed and diluted 
to 2 0 microliters with a buffer containing T4 DNA ligase and 
ATP, then incubated for 4-18 hours before transforming into 
JS. coli and selecting on ampicillin plates. 

10 Exon-intron boundaries were defined by comparing 

the nucleotide sequences of the human SUR gene and cDNA, 
which were obtained using the dideoxy chain termination 
method (Sequenase; U.S, Biochemicals, Cleveland, Ohio). 

Because of the consanguineous matings and autosomal 

15 recessive inheritance pattern of this disorder, affected 

individuals are expected to be homozygous by descent at the 
disease gene locus. E.S. Lander and D. Botstein, Science 
236, 1567 (1987) , the disclosure of which is hereby 
incorporated by reference in its entirety. Direct sequencing 

20 of a pancreatic cDNA product, isolated from an affected child 
of Family 6, revealed a 109 bp deletion within the NBF-2 
region which corresponded to skipping of an exon resulting in 
a cDNA product of about 2190 bp in length using primers of 
SEQ ID NOS: 2 0 and 21 as compared to mRNA of about 2 080 bp in 

25 length. The effects of this skipping event are severe and 
include production of a frameshift, premature truncation of 
the protein due to inclusion of a stop 24 codons later, and 
disruption of the NBF-2 (Figure 11A and 11B) . The splice 
sites of the skipped exon were evaluated at the genomic DNA 

3 0 level and a homozygous G to A point mutation, located within 
the 5' splice site at the last base of the skipped exon, was 
found (Figure 11C) . A recognition site for the restriction 
endonuclease MspJ is destroyed by this base change, providing 
a means to confirm and test for the presence of the mutation. 

35 mRNA was directly isolated using Oligotex (Qiagen Inc., 
Studio City, California) from a fresh-frozen pancreatic 
tissue sample and reverse transcribed (RT) , using random 
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primers (Invitrogen, San Diego, California) , with Superscript 
11 (GIBCO-BRL) into cDNA. For cloning of the NBF-2 region, 
an initial PCR amplification with SEQ ID NOS: 2 3 (primer 22 
(located 5' of 17)) and 19 (primer 29) was followed by a 
5 second amplification of a portion of the reaction with SEQ ID 
NOS: 16 (primer 17) and 19 (primer 29) using conditions 
described by P.M. Thomas, G.J. Cote, D *M. Hallman, P.M. 
Mathew, Am. J. Hum. Genet., in press, supra.. 

PCR products were amplified using hybridization at 

10 60°C for 1 minute, elongated at 72 °C for 1 minute and 
denatured at 93 °c for 1 minute for thirty cycles. 
Hybridization may be carried out at temperatures of between 
about 55°C to about 65°C. The amplified product was cloned 
into pCR 11™ vector (Invitrogen, San Diego, CA) and 

15 sequenced, as above. pCR 11™ vector is set forth in Figure 
13. The vector was purified using a standard cesium chloride 
method, restricted using the appropriate desired enzyme (s), 
the fragments were purified by electrophoresis on low melt 
agarose and cut out of the gel. A l-to-5 microliter aliquot 

20 of the desired fragment and 1 microgram of the appropriately 
restricted plasmid carrying a selectable ampicillin 
resistance marker (such as pBluescript from Stratagene, Inc.) 
were melted at 65 °C, mixed and diluted to 2 0 microliters with 
a buffer containing T4 DNA ligase and ATP, then incubated for 

25 4-18 hours before transforming into E. coli and selecting on 
ampicillin plates. For detection of the mutation in 

genomic fragments, 100 ng of genomic DNA was amplified using 
SEQ ID NOS; 18 and 20, primers 2 8 and 2 9B, as above except in 
the presence of PCR buffer N (Invitrogen, San Diego, CA) , and 

3 0 either directly PCR sequenced according to the methods of S. 
Khorana, R.F. Gagel, G.J. Cote, Nucleic Acids Res. 22, 3425 
(1994) , the disclosure of which is hereby incorporated herein 
by reference in its entirety, or cut with 5 U of Mspl (GIBCO- 
BRL) at 37 °C for 2 hours and run on a 10% polyacrylamide gel. 

3 5 Visualization of products was by silver staining. Both 
affected children of Family 6 were homozygous, while the 
parents and two unaffected siblings were found to be 
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heterozygous, for the mutation (Figure 11D) . Preliminary 
semiquantitative analysis revealed markedly decreased 
expression of the mutant SUR message upon comparison of 
patient and age-matched normal control pancreatic samples, 
5 suggesting instability of the mutant message. 

Thirteen additional affected children, from six 
families of Saudi Arabian origin and one family of German 
origin, were found to be homozygous for this mutation, as 
demonstrated by loss of the Mspl restriction enzyme 

10 recognition site. In all families, homozygous loss of the 
Mspl site cosegregated with disease phenotype, and in 
Families 1-3 and 5 genotype analysis for this mutation agreed 
with previously reported haplotype data, P.M. Thomas, G.J. 
Cote, D.M. Hallman, P.M. Mathew, Am. J. Hum* Genet., in 

15 press, supra. Direct sequencing of PCR-amplif ied genomic DNA 
from a representative affected member of each family 
determined that all exhibited the homozygous G to A mutation. 

Genomic DNA from affected and normal individuals 
was PCR-amplif ied using the SEQ ID NOS: 16 and 19 and cloned 

20 into pRSVhMT2A. Plasmids were purified using standard cesium 
chloride methods, restricted using the appropriate desired 
enzyme (s) . The fragments were purified by electrophoresis on 
low melt agarose and cut out of the gel. A l-to-5 microliter 
aliquot of the desired fragment and 1 microgram of the 

25 appropriately restricted plasmid carrying a selectable 
ampicillin resistance marker (such as pBluescript from 
Stratagene, Inc.) were melted at 65°c, mixed and diluted to 
20 microliters with a buffer containing T4 DNA ligase and 
ATP, then incubated for 4-18 hours before transforming into 

30 E . coli and selecting on ampicillin plates. 

Constructs were transfected into the human 
glioblastoma cell line SNB 19 using Lipof ectamine™ (Gibco- 
BRL, Gaithersburg, MD) . RT-PCR analysis was performed, with 
SEQ ID NOS: 18 (primer 16) and 23 (primer DS8) , as described 

35 by H. Lou, G.J. Cote, R.F. Gagel, Mol . Endo . 8, 1618 (1994), 
the disclosure of which is incorporated herein by reference 
in its entirety. The plasmids and their cDNA products were 
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sequenced with SEQ ID NO: 17 (primer 34al) . Genomic DNA 
fragments were PCR-amplif ied with SEQ ID NOS; 17 and 18 
(primers 34al and 16) and digested with Neil, as in Figure 
11. 

5 Family 4 demonstrated a unique mutation in the 3' 

splice site sequence preceding the start of the NBF-2 (Figure 
12A) . This G to A mutation destroys an Neil restriction 
endonuclease site and homozygous loss of this site 
cosegregated with disease phenotype within the family, 

10 Again, genotype analysis of the members of this family 

supported previously reported haplotype data, P.M. Thomas, 
G.J. Cote, D.M. Hallman, P.M. Mathew, Am. J. Hum. Genet*, in 
press, supra,; both parents are heterozygotes for the 
mutation and the unaffected sibling is homozygous for the 

15 wild type allele (Figure 12B) . Since a pancreatic tissue 
sample from an affected individual in Family 4 was 
unavailable and we were unable to recover the SUR message 
from transformed lymphocytes, a chimeric construct was 
created to examine the effects of this mutation on the RNA 

2 0 splicing pathways according to the methods of R. Takahashi, 

et al., Nature Genet. 7, 79 (1994); I. Satokata, et al., 
Proc, Natl. Acad. Sci . 87 9908 (1990); H. Lou, G.J. Cote, 
R.F. Gagel, Mol . Endo. 8, 1618 (1994), the disclosure of each 
hereby incorporated by reference in its entirety. With the 

25 construct containing the mutation, no wild type splicing 
pattern occurred. Instead, use of three cryptic 3' splice 
sites was demonstrated resulting in a 7 bp addition, a 20 bp 
deletion, and a 30 bp deletion in the exon (Figure 12D) . A 
similar intronic 3' splice acceptor mutation, described in 

30 the disorder 21-hydroxylase deficiency, also resulted in lack 
of the wild type splicing pattern, produced several cryptic 
splice products, and abolished normal protein activity, Y. 
Higashi, et al., Proc, Natl. Acad. Sci., USA 85, 7486 (1988), 
the disclosure of which is incorporated herein by reference 

3 5 in its entirety. 

All PCR products prepared from genomic DNA of 100 
normal, unrelated individuals showed normal Mspi and Neil 
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restriction patterns, indicating that neither mutation is a 
common polymorphism. The data presented provides evidence 
that mutations in the SUR gene cause familial persistent 
hyperinsulinemic hypoglycemia of infancy. 
5 Various modifications of the invention in addition 

to those shown and described herein will be apparent to those 
skilled in the art from the foregoing description. Such 
modifications are also intended to fall within the scope of 
the appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
<i) APPLICANT: 

(ii) TITLE OF INVENTION: Sequence Encoding Mammalian Sulfonylurea Receptor 
and Method of Detecting Persistent Hyper insulinemic Hypoglycemia of 
Infancy 

(iii) NUMBER OF SEQUENCES: 37 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Woodcock Washburn Kurtz Mackiewicz & 

Norris 

(B) STREET: One Liberty Place 46th, Floor 

(C) CITY: Philadelphia 

(D) STATE: PA 

(E) COUNTRY : USA 

(F) ZIP: 19103 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(Vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : PCT/US95/04463 

(B) FILING DATE: 12-APR-1995 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION : 

(A) NAME: Beardell, Lori Y. 

(B) REGISTRATION NUMBER: 34,2 93 

(C) REFERENCE /DOCKET NUMBER: BYLR-0004 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 215-568-3100 
CB) TELEFAX: 215-568-3439 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
ATGCCTTTGG CCTTCTGCGG CACCGAGAAC CACTCGGCCG CCTACCGGGT GG AC CAAGGC 6 0 

GTCCTCAACA ACGGCTGCTT CGTGGACGCG CTCAATGTGG TGCCACATGT CTTTCTGCTC 12 0 
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•TTrATrarfT 

X X \_-^V J- v— X 


TCCCCATCCT 


CTTCATCGGA 


TG C3 G G C AG C T 


AGAGCTCCAA 


GGTGCACATT 


XS 0 




CCTGGCTCCA 


TTTCCCGGGG 


CACAACCTGC 


GCTGGATCCT 


GACCTTCATA 


2 4 0 


rTGCTCTTCG 


TCCTCGTGTG 


TGAGATCGCT 


GAGGGTATCC 


TGTCTGACGG 


GGTGACAGAA 


3 00 


TCCCGCCACC 


TCCACTTATA 


CATGCCAGCT 


GGGATGGCAT 


TCATGGCTGC 


CATCACCTCT 


360 


GTGGTCTACT 


AC C AT AAC AT 


T GAG AC CT CT 


AACTTTCCCA 


AGCTGCTGAT 


TGCTCTGCTC 


420 


ATCTACTGGA 


CCCTGGCCTT 


CAT C ACGAAG 


AC CAT C AAGT 


TCGTCAAGTT 


CTACGACCAC 


480 


GCCATTGGCT 


TCTCTCAGCT 


GCGCTTCTGC 


CTCACGGGGC 


TTCTGGTGAT 


CCTCTACGGG 


540 


ATGCTGCTGC 


TTGTGGAGGT 


CAATGTCATC 


CGGGTGAGGA 


GATACNTCTT 


CTTCAAGACA 


600 


CCAAGGGAAG 


TAAAGCCCCC 


CGAGGACCTA 


CAGGACCTGG 


GTGTGCGCTT 


TCTGCAGCCC 


660 


TTCGTTAACC 


TGCTATCAAA 


GGGGACCTAC 


TGGTGGATGA 


ATGCCTTCAT 


CAAGACTGCT 


720 


CACAAGAAGC 


CCATCGACCT 


GCGGGCCATC 


GNGAAGCTGC 


CCATTGCCAT 


GAGAGCCCTC 


780 


ACCAACTACC 


AGCGACTCTG 


CNTGGCCTTC 


GATGCCCAGG 


CGCGGAAGGA 


CACACAGAGC 


B40 


CNGCAGGGTG 


CCCGGGCCAT 


CTGGAGGGCT 


CTCTGTCATG 


CCTTTGGGAG 


ACGGCTGGTC 


900 


CTCAGCAGCA 


C ATTC CGTAT 


CCTGGCCGAC 


CTCCTGGGCT 


TTGCTGGGCC 

X X WW X VJwsJ\« ^ 


ACT CTG CATC 


960 


TTnGGGATPG 


TGGACCArTT 


CGGGAAGGAG 




TCCAGCCCAA 


GACACAGTTT 


102 0 


V-» A A V_fl?-rVO A J. X 


x x ivji^is- 


ATC C C AAGAG 


TTCCTCGGCA 


ATGCCTATGT 


CTTGGCTGTT 


1080 


\- X X *w -i. VJ A X *w 


TTGCCCTCCT 


GCTGCAAAGG 


ACCTTTCTAC 


AAGCCTCGTA 


CTACGTTGCC 


1140 


ATTGAAACTG 


GGATCAACCT 


GAG AG GAG CA 


ATCCAGACCA 


AGATTTACAA 


TAAGATCATG 


1200 




fTTrrAATPT 

LI 1U _,/tlt^_. l_ x 


\J X UUrt X \3\J\J\J 


GAAATGACTG 


GTGGGCAG AT 


CTGCAACCTG 


126 0 


Ls x UuLl-Al L-Lj 


ALnL L. AAL- L. A 


rr"T i r i 'h r vnT{ r zrz 

Lj L- i LHiy 1 O^vJ 


TTTTTCTTCT 


TATGPPPA A A 


rrTPTGfswr'T 


132 0 


ATGCL.GGTAC 


AG AT C ATI G I 


L^GLjC-Cj I GA1 L. 


L. 1 LL. 1 L. iAC I 


ALAi LLI 1 Lit? 


(jjLj I CALiL-LtUC. 


1 T OA 
1 J D U 


TTGATTGGAG 


CGGCTGTCAT 


LAxx L, IGLlb 


CjC I LC i Lj I 


ALt 1AL I 1 I (a I 


LioL, UAL- L.AAL? 


144 0 


L, X Li H_L.t-A.tjUt 


L» AL. ALt L-Li Ax, 




lril X v^jf-i-f-i. X va 


AGAGGPTGA A 


GPAGAPPAAT 


15 0 0 


L?ALxAa LiL- 1 L.L. 


LrvjrLjLfi-AxINAA. 




\— X \- X X VJ X 


GGGAGAAPAT 


PTTPTGCTCr 


1560 




■rtvj/"VV*rt VJ v— .rV— • 


C^ZlAGGAAATG 


ACCAGCCTCA 


GGGCCTTCGC 


TGTCTACACC 


1620 


t r* r a t r* t c c a 


x \_ x x V— x wru^ 


CACAGCTATC 


CCCATCGCTG 


CTGTCCTCAT 


CACCTTCGTG 


1680 


*J A^w X V^^V 


GCTTCTTCAA 

u w X X V- X X w/Vi 


AG AGT C GG AC 


NTCTCGCCCT 


CGGTGGCCTT 


TGCCTCTCTC 


1740 


rp ^-i r—i i-» rp m rm rri r- 1 (~t 

A L, A L- i L, I X LL 


AL.A ILLi oo i 






r*TAGTGTGGT 

\^ X riu X X V3 V3 X 




1800 


GTCAAGGCCC 


TGGTGAGCGT 


GCAAAAGCTG 


AGTGAGTTCC 


TGTCCAGTGC 


AGAGATCCGT 


1860 


GAGGAACAGT 


GTGCCCCCCG 


AGAGCCCGCA 


CCCCAAGGCC 


AAGCGGGCAA 


GTACCAGGCG 


1920 


GTGCCCCTCA 


AGGTCGTAAA 


CCGCAAGCGC 


CCAGCCCGAG 


AAGAAGTCCG 


GGACCTCTTG 


1980 


GGCCCACTGC 


AGAGGCTGNC 


TCCCAGCAKG 


GATGGAGACG 


CTGACAACTT 


CTGTGTCCAG 


2040 


ATCATCGGAG 


GCTTCTTCAC 


CTGGACCCCT 


GATGGAATCC 


CCACCCTGTC 


CAACATCACC 


2100 


ATCCGTATCC 


CCCGAGGTCA 


GCTGACCATG 


ATCGTGGGGC 


AGGTGGGCTG 


TGGCAAGTCC 


2160 
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TGGCTCCTTC 


TGGCCACCCT 


GGGGG AG AT G 


CAGAAGGTCT 


CTGGAGCTGT 


CTTCTGGAAC 


2220 


AGCCTTCCAG 


ACAGCGAGGG 


G AG ANG AC C C 


CAGCAACCCA 


GAGCGGGAGA 


CAGCNGCNGN 


2280 


NTCGGATNCC 


AGGAGCAGAG 


GCCCNGTGGC 


TA CG CATC TC 


AGAAACCATG 


GCTGCTAAAT 


2340 


GCCACTGTGG 


AGGAGAACAT 


CACCTTCGAG 


AGTCCCTTCA 


ATNNGCAACG 


GTACAAGATG 


2400 


GTCATCGAAG 


CCTGCTCCCT 


GCAGCCAGAC 


ATAGACATCC 


TGCCCCATGG 


AGACCAGACT 


2460 


CAGATTGGGG 


AACGAGGCAT 


CAACTTGAGT 


ACTGGTGGTC 


AGCGTCCAGA 


TCAGTGTNGA 


2520 


CCCGAGCCCT 


CTACCAGCAN 


ACCAATGNTT 


GTCTTTTTGG 


ATGACCCTTT 


CTCGGCTCTG 


2580 


GATGTCCATC 


TGAGTGACCA 


CCTAATGCAG 


GCTGGCATCC 


TCGAGCTGCT 


CCGGGATGAC 


2640 


AAGAGGACAG 


TGGTCTTGGT 


GACCCACAAG 


CTACAGTACC 


TGCCTCATGC 


TGACTGGATC 


2700 


ATTGCTATGA 


AGGATGGCAC 


CATTCAGAGG 


GAGGGGACAC 


TCAAGGACTT 


CCAGAGGTCT 


2760 


GAGTGCCAGC 


TCTTTGAGCA 


TTGGAAGACC 


CTCATGAACC 


GGCAGGACCA 


AGAGCTGGAG 


2820 


AAGGAGACAG 


TCATG G AG AG 


AAAAGCCNCN 


GAGCCATCTC 


AGGGCCTGCC 


CCGTGCCATG 


2880 


TCCTCAAGAG 


ATGGCCTTCT 


GCTGGATGAG 


GANGAGGAGG 


AAGAGGAGGC 


AGCCGAGAGC 


2940 


GAGGAAGATG 


ACAACTTATC 


CTCTGTGCTG 


CATCAGCGAG 


CCAAGATCCC 


ATGGCGAGCC 


3000 


TGCACCAAGT 


ATTTGTCCTC 


TGCTGGCATC 


CTGCTCCTGT 


CCCTGCTTGT 


CTTCTCCCAG 


3060 


CTGCTCAAGC 


ACATGGTCTT 


GGTGGCCATT 


GACTACTGGC 


TGGCCAAGTG 


GACGGACAGT 


3120 


GCCCTGGTCC 


TGAGCCCCGC 


CGCCAGGAAC 


TGCTCCCTCA 


GCCAGGAATG 


TGNCCTGGAC 


3180 


CAATCTGTCT 


ATGCCATGGT 


ATTCACCNTG 


CTCTGCAGCC 


TGGGTATCGN 


GCTGTGCCTT 


3240 


GTCACCTCTG 


TCACTGTGGA 


GTGGACGGGA 


CTGAAGGTGG 


CCAAGAGGCT 


GCATCGCAGC 


3300 


CTGCTCAACC 


GTATCATCCT 


GGCTCCCATG 


AGGTTCTTTG 


AGACCACGCC 


CCTGGGGAGT 


3360 


ATCCTGAACA 


GATTTTCATC 


TGACTGTAAC 


ACCATTGACC 


AGCATATCCC 


GTCCACGCTG 


3420 


GAGTGCCTGA 


GCAGATCCAC 


CTTACTCTGT 


GTCTCCGCCC 


TGKCTGTCAT 


CTCCTACGTC 


3480 


ACGCCTGTGT 


TCCTAGTGGC 


CCTCTTACCC 


C'TCGCCGTCG 


TGTGCTACTT 


CATCCAGAAG 


3540 


TACTTCCGAG 


TGGCGTCCAG 


GGACCTGCAG 


CAGCTGGACG 


ACACAACACA 


GCTCCCTCTG 


3600 


NTCTCACACT 


TTGCTGAAAC 


TGTGGAAGGA 


CTCACCACCA 


TCCGTGCCTT 


C AG G TACGAG 


3660 


GCCCGGTTCC 


AGCAGAAGCT 


CCTAGAGTAC 


AC CG ACT CCA 


ACAACATTGC 


CTCTCTCTTC 


3720 


CTCACAGCAG 


CCAACAGGTG 


GCTGGAAGTC 


NGCATGGAGT 


ACATCGGAGC 


ATGCGTGGTA 


3780 


CTCATCGCCG 


CTGCCACCTC 


CATCTCCAAC 


TCCCTACACA 


GGGAGCTCTC 


AGCCGGCCTA 


3840 


GTAGGCCTGG 


GCCTCACCTA 


TGCCTTGATG 


ATTGGGATCT 


GCGGCCGCAC 


AGGCAGTGGA 


3900 


AAATCCTCCT 


TCTCTCTCGC 


CTTTTTCCGA 


ATGGTGGATA 


TGTTTGAAGG 


GCGTATCATC 


3960 


ATCGATGGCA 


TTGACATCGC 


CAAGCTGCCG 


CTGCACACGC 


TCCGCTCACG 


CCTGTCTATC 


4020 


ATCCTACAGG 


ACCCTGTTCT 


CTTCAGTGGT 


AC CATC AG AT 


TCAACCTGGA 


CCCAGAGAAG 


4080 


AAATGCTCAG 


ACAGCACGCT 


GTGGGAGGCT 


CTGGAGATCG 


CTCAGCTGAA 


GCTGGTGGTG 


4140 


AAGGCCCTGC 


CAGGAGGCCT 


GGATGCCATC 


ATCACGGAAG 


GAGGGGAGAA 


TTTTAGCCAG 


4200 
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GGCCAGAGGC AGCTGTTCTG CCTGGCCCGG GCCTTTGTGA GGAAGACCAG CATCTTCATC 42 60 

ATGGATGAAG CAACTGCCTC CATCGACATG GCTACGGAAA ATATCCTCCA GAAGGTGGTG 432 0 

ATGACAGCCT TCGCAGACCG CACCGTGGTC ACCATCGCGC ACCGCGTGCA CACCATCCTG 43 8 0 

AGTGCAGACC TAGTGATGGT CCTGAAGAGG GGCGCGATCC TGGAGTTCGA CAAGCCGGAA 444 0 

ANGCTTCTCA GCCAGAAGGA CAGCGTCTTT GCCTCCTTTG TCCGCGCGGA CAAATGACCA 4 5 00 

GCCAGCGCCA AAGTGCCACC CCACACCTCA CCTGCTTGCC ATGGATTTCT TACTGTAAAT 4 560 

CACTTGTAAA TAAAGAAACT AATTCTTTGC TAAAAAAAA 4 5 95 
(3) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
GCGCGGAGCC GGAGCCGAGC CCGTGCGCGC GCCACC 36 
(4) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 3 : 
AGCCGAGCCC GTGCGCGCGC CGCC 24 
(5) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : cDNA 



(iii) HYPOTHETICAL : NO 



(iv) ANTI - SENSE : NO 
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(v) FRAGMENT TYPE: N- terminal 

(XX J FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 37.. 4533 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GCGCGGAGCC GGAGCCGAGC CCGTGCGCGC GCCACC ATG CCT TTG GCC TTC TGC 54 

Met Pro Leu Ala Phe Cys 
1 5 

GGC ACC GAG AAC CAC TCG GCC GCC TAC CGG GTG GAC CAA GGC GTC CTC 102 
Gly Thr Glu Asn His Ser Ala Ala Tyr Arg Val Asp Gin Gly Val Leu 
10 15 20 

AAC AAC GGC TGC TTC GTG GAC GCG CTC AAT GTG GTG CCA CAT GTC TTT 150 
Asn Asn Gly Cys Phe Val Asp Ala Leu Asn Val Val Pro His Val Phe 
25 30 35 

CTG CTC TTC ATC ACC TTC CCC ATC CTC TTC ATC GGA TGG GGC AGC CAG 198 
Leu Leu Phe lie Thr Phe Pro lie Leu Phe lie Gly Trp Gly Ser Gin 
40 45 50 

AGC TCC AAG GTG CAC ATT CAC CAC AGC ACC TGG CTC CAT TTC CCG GGG 246 
Ser Ser Lys Val His lie His His Ser Thr Trp Leu His Phe Pro Gly 
55 60 65 70 

CAC AAC CTG CGC TGG ATC CTG ACC TTC ATA CTG CTC TTC GTC CTC GTG 294 
His Asn Leu Arg Trp lie Leu Thr Phe lie Leu Leu Phe Val Leu Val 
75 80 85 

TGT GAG ATC GCT GAG GGT ATC CTG TCT GAC GGG GTG ACA GAA TCC CGC 342 
Cys Glu lie Ala Glu Gly lie Leu Ser Asp Gly Val Thr Glu Ser Arg 
90 95 100 

CAC CTC CAC TTA TAC ATG CCA GCT GGG ATG GCA TTC ATG GCT GCC ATC 3 90 

His Leu His Leu Tyr Met Pro Ala Gly Met Ala Phe Met Ala Ala lie 
105 110 115 

ACC TCT GTG GTC TAC TAC CAT AAC ATT GAG ACC TCT AAC TTT CCC AAG 438 
Thr Ser Val Val Tyr Tyr His Asn lie Glu Thr Ser Asn Phe Pro Lys 
120 125 130 

CTG CTG ATT GCT CTG CTC ATC TAC TGG ACC CTG GCC TTC ATC ACG AAG 486 
Leu Leu He Ala Leu Leu He Tyr Trp Thr Leu Ala Phe He Thr Lys 
135 140 145 150 

ACC ATC AAG TTC GTC AAG TTC TAC GAC CAC GCC ATT GGC TTC TCT CAG 534 
Thr He Lys Phe Val Lys Phe Tyr Asp His Ala He Gly Phe Ser Gin 
155 160 165 

CTG CGC TTC TGC CTC ACG GGG CTT CTG GTG ATC CTC TAC GGG ATG CTG 582 
Leu Arg Phe Cys Leu Thr Gly Leu Leu Val He Leu Tyr Gly Met Leu 
170 175 180 

CTG CTT GTG GAG GTC AAT GTC ATC CGG GTG AGG AGA TAC GTC TTC TTC 63 0 

Leu Leu Val Glu Val Asn Val He Arg Val Arg Arg Tyr Val Phe Phe 
185 190 195 

AAG ACA CCA AGG GAA GTA AAG CCC CCC GAG GAC CTA CAG GAC CTG GGT 678 
Lys Thr Pro Arg Glu Val Lys Pro Pro Glu Asp Leu Gin Asp Leu Gly 
200 205 210 

GTG CGC TTT CTG CAG CCC TTC GTT AAC CTG CTA TCA AAG GGG ACC TAC 726 



SUBSTITUTE SHEET (RULE 26) 



BMSDOCID: <WO__952841 1 A1 J_> 



WO 95/28411 



PCT/13S95/04463 



59 

Val Arg Phe Leu Gin Pro Phe Val Asn Leu Leu Ser Lys Gly Thr Tyr 
215 220 225 230 

TGG TGG ATG AAT GCC TTC ATC AAG ACT GCT CAC AAG AAG CCC ATC GAC 7 74 

Trp Trp Met Asn Ala Phe lie Lys Thr Ala His Lys Lys Pro lie Asp 
235 240 245 

CTG CGG GCC ATC GGG AAG CTG CCC ATT GCC ATG AGA GCC CTC ACC AAC 822 
Leu Arg Ala lie Gly Lys Leu Pro He Ala Met Arg Ala Leu Thr Asn 
250 255 260 

TAC CAG CGA CTC TGC TTG GCC TTC GAT GCC CAG GCG CGG AAG GAC ACA 8 70 

Tyr Gin Arg Leu Cys Leu Ala Phe Asp Ala Gin Ala Arg Lys Asp Thr 
265 270 275 

CAG AGC CAG CAG GGT GCC CGG GCC ATC TGG AGG GCT CTC TGT CAT GCC 918 
Gin Ser Gin Gin Gly Ala Arg Ala He Trp Arg Ala Leu Cys His Ala 
280 285 290 

TTT GGG AGA CGG CTG GTC CTC AGC AGC ACA TTC CGT ATC CTG GCC GAC 966 
Phe Gly Arg Arg Leu Val Leu Ser Ser Thr Phe Arg He Leu Ala Asp 
295 ' 300 305 310 

CTC CTG GGC TTT GCT GGG CCA CTC TGC ATC TTC GGG ATC GTG GAC CAC 1014 
Leu Leu Gly Phe Ala Gly Pro Leu Cys He Phe Gly He Val Asp His 
315 320 325 

CTC GGG AAG GAG AAC CAC GTC TTC CAG CCC AAG ACA CAG TTT CTT GGA 106 2 

Leu Gly Lys Glu Asn His Val Phe Gin Pro Lys Thr Gin Phe Leu Gly 
330 335 340 

GTT TAC TTT GTC TCA TCC CAA GAG TTC CTC GGC AAT GCC TAT GTC TTG 1110 
Val Tyr Phe Val Ser Ser Gin Glu Phe Leu Gly Asn Ala Tyr Val Leu 
345 350 355 

GCT GTT CTT CTG TTC CTT GCC CTC CTG CTG CAA AGG ACC TTT CTA CAA 1158 
Ala Val Leu Leu Phe Leu Ala Leu Leu Leu Gin Arg Thr Phe Leu Gin 

360 365 370 

GCC TCG TAC TAC GTT GCC ATT GAA ACT GGG ATC AAC CTG AGA GGA GCA 12 0 6 

Ala Ser Tyr Tyr Val Ala He Glu Thr Gly He Asn Leu Arg Gly Ala 
375 380 385 390 

ATC CAG ACC AAG ATT TAC AAT AAG ATC ATG CAC TTG TCT ACT TCC AAC 12 54 

He Gin Thr Lys He Tyr Asn Lys lie Met His Leu Ser Thr Ser Asn 
395 400 405 

CTG TCC ATG GGG GAA ATG ACT GCT GGG CAG ATC TGC AAC CTG GTG GCC 13 02 

Leu Ser Met Gly Glu Met Thr Ala Gly Gin He Cys Asn Leu Val Ala 
410 415 420 

ATC GAC ACC AAC CAG CTC ATG TGG TTT TTC TTC TTA TGC CCA AAC CTC 13 5 0 

He Asp Thr Asn Gin Leu Met Trp Phe Phe Phe Leu Cys Pro Asn Leu 
425 430 435 

TGG GCT ATG CCG GTA CAG ATC ATT GTG GGC GTG ATC CTC CTC TAC TAC 13 98 

Trp Ala Met Pro Val Gin He He Val Gly Val He Leu Leu Tyr Tyr 
440 445 450 

ATC CTT GGG GTC AGC GCC TTG ATT GGA GCG GCT GTC ATC ATT CTG CTG 144 6 

He Leu Gly Val Ser Ala Leu He Gly Ala Ala Val lie He Leu Leu 
455 460 465 470 

GCT CCT GTA CAG TAC TTT GTG GCC ACC AAG CTG TCC CAG GCA CAG CGG 14 94 

Ala Pro Val Gin Tyr Phe Val Ala Thr Lys Leu Ser Gin Ala Gin Arg 
475 480 485 
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ACG ACC CTG GAA TAT TCC AAT GAG AGG CTG AAG CAG ACC AAT GAG ATG 1542 

Thr Thr Leu Glu Tyr Ser Asn Glu Arg Leu Lys Gin Thr Asn Glu Met 
490 495 500 

CTC CGG GGC ATC AAG TTG CTC AAG CTC TAT GCG TGG GAG AAC ATC TTC 15 90 

Leu Arg Gly He Lys Leu Leu Lys Leu Tyr Ala Trp Glu Asn He Phe 
505 510 515 

TGC TCC AGG GTG GAG AAG ACA CGC AGG AAG GAA ATG ACC AGC CTC AGG 163 8 

Cys Ser Arg Val Glu Lys Thr Arg Arg Lys Glu Met Thr Ser Leu Arg 
520 525 530 

GCC TTC GCT GTC TAC ACC TCC ATC TCC ATC TTC ATG AAC ACA GCT ATC 1686 

Ala Phe Ala Val Tyr Thr Ser He Ser He Phe Met Asn Thr Ala He 
535 540 545 550 

CCC ATC GCT GCT GTC CTC ATC ACC TTC GTG GGC CAC GTC AGC TTC TTC 17 34 

Pro He Ala Ala Val Leu He Thr Phe Val Gly His Val Ser Phe Phe 
555 560 565 

AAA GAG TCG GAC TTC TCG CCC TCG GTG GCC TTT GCC TCT CTC TCT CTC 1782 

Lys Glu Ser Asp Phe Ser Pro Ser Val Ala Phe Ala Ser Leu Ser Leu 
570 575 580 

TTC CAC ATC CTG GTC ACA CCG CTG TTC CTG CTG TCT AGT GTG GTT CGG 18 3 0 

Phe His He Leu Val Thr Pro Leu Phe Leu Leu Ser Ser Val Val Arg 
585 590 595 

TCC ACT GTC AAG GCC CTG GTG AGC GTG CAA AAG CTG AGT GAG TTC CTG 18 78 

Ser Thr Val Lys Ala Leu Val Ser Val Gin Lys Leu Ser Glu Phe Leu 
600 605 610 

TCC AGT GCA GAG ATC CGT GAG GAA CAG TGT GCC CCC CGA GAG CCC GCA 1926 

Ser Ser Ala Glu He Arg Glu Glu Gin Cys Ala Pro Arg Glu Pro Ala 
615 620 625 630 

CCC CAA GGC CAA GCG GGC AAG TAC CAG GCG GTG CCC CTC AAG GTC GTA 1974 

Pro Gin Gly Gin Ala Gly Lys Tyr Gin Ala Val Pro Leu Lys Val Val 
635 640 645 

AAC CGC AAG CGC CCA GCC CGA GAA GAA GTC CGG GAC CTC TTG GGC CCA 2 022 

Asn Arg Lys Arg Pro Ala Arg Glu Glu Val Arg Asp Leu Leu Gly Pro 
650 655 660 

CTG CAG AGG CTG ACT CCC AGC ACG GAT GGA GAC GCT GAC AAC TTC TGT 20 70 

Leu Gin Arg Leu Thr Pro Ser Thr Asp Gly Asp Ala Asp Asn Phe Cys 
665 670 .675 

GTC CAG ATC ATC GGA GGC TTC TTC ACC TGG ACC CCT GAT GGA ATC CCC 2118 

Val Gin He He Gly Gly Phe Phe Thr Trp Thr Pro Asp Gly lie Pro 
680 685 690 

ACC CTG TCC AAC ATC ACC ATC CGT ATC CCC CGA GGT CAG CTG ACC ATG 2166 

Thr Leu Ser Asn He Thr He Arg He Pro Arg Gly Gin Leu Thr Met 
695 700 705 710 

ATC GTG GGG CAG GTG GGC TGT GGC AAG TCC TCG CTC CTT CTG GCC ACC 2214 

lie Val Gly Gin Val Gly Cys Gly Lys Ser Ser Leu Leu Leu Ala Thr 
715 720 725 

CTG GGG GAG ATG CAG AAG GTC TCT GGA GCT GTC TTC TGG AAC AGC CTT 2262 

Leu Gly Glu Met Gin Lys Val Ser Gly Ala Val Phe Trp Asn Ser Leu 
730 735 740 

CCA GAC AGC GAG GGG AGA AGA CCC CAG CAA CCC AGA GCG GGA GAC AGC 2310 

Pro Astd Ser Glu Gly Arg Arcr Pro Gin Gin Pro Arg Ala Gly Asp Ser 
745 ~ 750 755 
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GGC CGA TTC GGA TGC CAG GAG CAG AGG CCC TGT GGC TAC GCA TCT CAG 23 58 

Gly Arg Phe Gly Cys Gin Glu Gin Arg Pro Gys Gly Tyr Ala Ser Gin 
760 765 770 

AAA CCA TGG CTG CTA AAT GCC ACT GTG GAG GAG AAC ATC ACC TTC GAG 2406 
Lys Pro Trp Leu Leu Asn Ala Thr Val Glu Glu Asn lie Thr Phe Glu 
775 760 785 790 

AGT CCC TTC AAT AAG CAA CGG TAC AAG ATG GTC ATC GAA GCC TGC TCC 24 54 

Ser Pro Phe Asn Lys Gin Arg Tyr Lys Met Val lie Glu Ala Cys Ser 
795 800 805 

CTG CAG CCA GAC ATA GAC ATC CTG CCC CAT GGA GAC CAG ACT CAG ATT 25 02 

Leu Gin Pro Asp lie Asp lie Leu Pro His Gly Asp Gin Thr Gin lie 
810 815 820 

GGG GAA CGA GGC ATC AAC TTG AGT ACT GGT GGT CAG CGT CCA GAT CAG 2 55 0 

Gly Glu Arg Gly lie Asn Leu Ser Thr Gly Gly Gin Arg Pro Asp Gin 
825 830 835 

TGT AGA CCC GAG CCC TCT ACC AGC ACA CCA ATG ATT GTC TTT TTG GAT 25 98 

Cys Arg Pro Glu Pro Ser Thr Ser Thr Pro Met lie Val Phe Leu Asp 
840 845 850 

GAC CCT TTC TCG GCT CTG GAT GTC CAT CTG AGT GAC CAC CTA ATG CAG 264 6 

Asp Pro Phe Ser Ala Leu Asp Val His Leu Ser Asp His Leu Met Gin 
855 860 865 870 

GCT GGC ATC CTC GAG CTG CTC CGG GAT GAC AAG AGG ACA GTG GTC TTG 2 694 

Ala Gly lie Leu Glu Leu Leu Arg Asp Asp Lys Arg Thr Val Val Leu 
875 880 885 

GTG ACC CAC AAG CTA CAG TAC CTG CCT CAT GCT GAC TGG ATC ATT GCT 2 742 

Val Thr His Lys Leu Gin Tyr Leu Pro His Ala Asp Trp He He Ala 
890 895 900 

ATG AAG GAT GGC ACC ATT CAG AGG GAG GGG ACA CTC AAG GAC TTC CAG 2 7 90 

Met Lys Asp Gly Thr He Gin Arg Glu Gly Thr Leu Lys Asp Phe Gin 
905 910 915 

AGG TCT GAG TGC CAG CTC TTT GAG C£T TGG AAG ACC CTC ATG AAC CGG 2B3 8 

Arg Ser Glu Cys Gin Leu Phe Glu His Trp Lys Thr Leu Met Asn Arg 
920 925 930 

CAG GAC CAA GAG CTG GAG AAG GAG ACA GTC ATG GAG AGA AAA GCC CCA 2886 
Gin Asp Gin Glu Leu Glu Lys Glu Thr Val Met Glu Arg Lys Ala Pro 
935 940 945 950 

GAG CCA TCT CAG GGC CTG CCC CGT GCC ATG TCC TCA AGA GAT GGC CTT 2 934 

Glu Pro Ser Gin Gly Leu Pro Arg Ala Met Ser Ser Arg Asp Gly Leu 
955 960 965 

CTG CTG GAT GAG GAT GAG GAG GAA GAG GAG GCA GCC GAG AGC GAG GAA 2 982 

'Leu Leu Asp Glu Asp Glu Glu Glu Glu Glu Ala Ala Glu Ser Glu Glu 
970 975 980 

GAT GAC AAC TTA TCC TCT GTG CTG CAT CAG CGA GCC AAG ATC CCA TGG 3 03 0 

Asp Asp Asn Leu Ser Ser Val Leu His Gin Arg Ala Lys He Pro Trp 
985 990 995 

CGA GCC TGC ACC AAG TAT TTG TCC TCT GCT GGC ATC CTG CTC CTG TCC 3 07 8 

Arg Ala Cys Thr Lys Tyr Leu Ser Ser Ala Gly He Leu Leu Leu Ser 
1000 1005 1010 

CTG CTT GTC TTC TCC CAG CTG CTC AAG CAC ATG GTC TTG GTG GCC ATT 3126 
Leu Leu Val Phe Ser Gin Leu Leu Lys His Met Val Leu Val Ala lie 
1015 1020 1025 1030 
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GAC TAC TGG CTG GCC AAG TGG ACG GAC AGT GCC CTG GTC CTG AGC CCC 317 4 

Asp Tyr Trp Leu Ala Lys Trp Thr Asp Ser Ala Leu Val Leu Ser Pro 
1035 1040 1045 

GCC GCC AGG AAC TGC TCC CTC AGC CAG GAA TGT GCC CTG GAC CAA TCT 3 222 

Ala Ala Arg Asn Cys Ser Leu Ser Gin Glu Cys Ala Leu Asp Gin Ser 
1050 1055 1060 

GTC TAT GCC ATG GTA TTC ACC GTG CTC TGC AGC CTG GGT ATC GCG CTG 32 70 

Val Tyr Ala Met Val Phe Thr Val Leu Cys Ser Leu Gly He Ala Leu 
1065 1070 1075 

TGC CTT GTC ACC TCT GTC ACT GTG GAG TGG ACG GGA CTG AAG GTG GCC 3318 
Cys Leu Val Thr Ser Val Thr Val Glu Trp Thr Gly Leu Lys Val Ala 
1080 1085 1090 

AAG AGG CTG CAT CGC AGC CTG CTC AAC CGT ATC ATC CTG GCT CCC ATG 3 366 

Lys Arg Leu His Arg Ser Leu Leu Asn Arg He lie Leu Ala Pro Met 
1095 1100 1105 1110 

AGG TTC TTT GAG ACC ACG CCC CTG GGG AGT ATC CTG AAC AGA TTT TCA 3414 
Arg Phe Phe Glu Thr Thr Pro Leu Gly Ser He Leu Asn Arg Phe Ser 
1115 1120 1125 

TCT GAC TGT AAC ACC ATT GAC CAG CAT ATC CCG TCC ACG CTG GAG TGC 3462 
Ser Asp Cys Asn Thr He Asp Gin His He Pro Ser Thr Leu Glu Cys 
1130 1135 1140 

CTG AGC AGA TCC ACC TTA CTC TGT GTC TCC GCC CTG GCT GTC ATC TCC 3 510 

Leu Ser Arg Ser Thr Leu Leu Cys Val Ser Ala Leu Ala Val He Ser 
1145 1150 1155 

TAC GTC ACG CCT GTG TTC CTA GTG GCC CTC TTA CCC CTC GCC GTC GTG 3 5 58 

Tyr Val Thr Pro Val Phe Leu Val Ala Leu Leu Pro Leu Ala Val Val 
1160 1165 1170 

TGC TAC TTC ATC CAG AAG TAC TTC CGA GTG GCG TCC AGG GAC CTG CAG 3 606 

Cys Tyr Phe lie Gin Lys Tyr Phe Arg Val Ala Ser Arg Asp Leu Gin 
1175 1160 1185 1190 

CAG CTG GAC GAC ACA ACA CAG CTC CCT CTG CTC TCA CAC TTT GCT GAA 3654 
Gin Leu Asp Asp Thr Thr Gin Leu Pro Leu Leu Ser His Phe Ala Glu 
1195 1200 1205 

ACT GTG GAA GGA CTC ACC ACC ATC CGT GCC TTC AGG TAC GAG GCC CGG 3 702 

Thr Val Glu Gly Leu Thr Thr He Arg Ala Phe Arg Tyr Glu Ala Arg 
1210 1215 1220 

TTC CAG CAG AAG CTC CTA GAG TAC ACC GAC TCC AAC AAC ATT GCC TCT .3750 
Phe Gin Gin Lys Leu Leu Glu Tyr Thr Asp Ser Asn Asn He Ala Ser 
1225 1230 1235 

CTC TTC CTC ACA GCA GCC AAC AGG TGG CTG GAA GTC CGC ATG GAG TAC 3 7 98 

Leu Phe Leu Thr Ala Ala Asn Arg Trp Leu Glu Val Arg Met Glu Tyr 
1240 1245 1250 

ATC GGA GCA TGC GTG GTA CTC ATC GCC GCT GCC ACC TCC ATC TCC AAC 3 846 

He Gly Ala Cys Val Val Leu He Ala Ala Ala Thr Ser He Ser Asn 
1255 1260 1265 1270 

TCC CTA CAC AGG GAG CTC TCA GCC GGC CTA GTA GGC CTG GGC CTC ACC 3 8 94 

Ser Leu His Arg Glu Leu Ser Ala Gly Leu Val Gly Leu Gly Leu Thr 
1275 1280 1285 

TAT GCC TTG ATG ATT GGG ATC TGC GGC CGC ACA GGC AGT GGA AAA TCC 3 942 

Tyr Ala Leu Met He Gly He Cys Gly Arg Thr Gly Ser Gly Lys Ser 
1290 1295 1300 
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TCC TTC TCT CTC GCC TTT TTC CGA ATG GTG GAT ATG TTT GAA GGG CGT 3 9 90 

Ser Pile Ser Leu Ala Phe Phe Arg Met Val Asp Met Phe Glu Gly Arg 
1305 1310 1315 

ATC ATC ATC GAT GGC ATT GAC ATC GCC AAG CTG CCG CTG CAC ACG CTC 4 03 8 

lie lie He Asp Gly He Asp He Ala Lvs Leu Pro Leu His Thr Leu 
1320 1325 \ 1330 

CGC TCA CGC CTG TCT ATC ATC CTA CAG GAC CCT GTT CTC TTC AGT GGT 4 08 6 

Arg Ser Arg Leu Ser He lie Leu Gin Asp pro Val Leu Phe Ser Gly 
1335 1340 1345 1350 

ACC ATC AGA TTC AAC CTG GAC CCA GAG AAG AAA TGC TCA GAC AGC ACG 4134 
Thr He Arg Phe Asn Leu Asp Pro Glu Lys Lys Cys Ser Asp Ser Thr 
1355 1360 1365 

CTG TGG GAG GCT CTG GAG ATC GCT CAG CTG AAG CTG GTG GTG AAG GCC 4182 
Leu Trp Glu Ala Leu Glu He Ala Gin Leu Lvs Leu Val Val Lys Ala 
1370 1375 "* 1380 

CTG CCA GGA GGC CTG GAT GCC ATC ATC ACG GAA GGA GGG GAG AAT TTT 42 3 0 

Leu Pro Gly Gly Leu Asp Ala He lie Thr Glu Gly Gly Glu Asn Phe 
1385 1390 1395 

AGC CAG GGC CAG AGG CAG CTG TTC TGC CTG GCC CGG GCC TTT GTG AGG 4278 
Ser Gin Gly Gin Arg Gin Leu Phe Cys Leu Ala Arg Ala Phe Val Arg 
1400 1405 1410 

AAG ACC AGC ATC TTC ATC ATG GAT GAA GCA ACT GCC TCC ATC GAC ATG 43 26 

Lys Thr Ser lie Phe lie Met Asp Glu Ala Thr Ala Ser lie Asp Met 
1415 1420 1425 1430 

GCT ACG GAA AAT ATC CTC CAG AAG GTG GTG ATG ACA GCC TTC GCA GAC 43 74 

Ala Thr Glu Asn He Leu Gin Lys Val Val Met Thr Ala Phe Ala Asp 
1435 1440 1445 

CGC ACC GTG GTC ACC ATC GCG CAC CGC GTG CAC ACC ATC CTG AGT GCA 4422 
Arg Thr Val Val Thr He Ala His Arg Val His Thr He Leu Ser Ala 
1450 1455 1460 

GAC CTA GTG ATG GTC CTG AAG AGG GGC GCG ATC CTG GAG TTC GAC AAG 44 70 

Asp Leu Val Met Val Leu Lys Arg Gly Ala He Leu Glu Phe Asp Lys 
1465 1470 1475 

CCG GAA AAG CTT CTC AGC CAG AAG GAC AGC GTC TTT GCC TCC TTT GTC 4 518 

Pro Glu Lys Leu Leu Ser Gin Lys Asp Ser Val Phe Ala Ser Phe Val 
1480 1485 1490 

CGC GCG GAC AAA TG AC CAG CCA GCGCCAAAGT GCCACCCCAC ACCTCACCTG 4 5 70 

Arg Ala Asp Lys 

1495 

CTTGCCATGG ATTTCTTACT GTAAATCACT TGTAAATAAA GAAACTAATT CTTTG CTAAA 4 63 0 
AAAAA 4635 



(6) INFORMATION FOR SEQ ID NO : 5 : 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 98 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 5 ; 
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Met Pro Leu Ala Phe Cys Gly Thr Glu Asn His Ser Ala Ala Tyr Arg 
15 10 15 

Val Asp Gin Gly Val Leu Asn Asn Gly Cys Phe Val Asp Ala Leu Asn 
20 25 30 

Val Val Pro His Val Phe Leu Leu Phe lie, Thr Phe Pro He Leu Phe 
35 40 45 

He Gly Trp Gly Ser Gin Ser Ser Lys Val His He His His Ser Thr 
50 55 60 

Trp Leu His Phe Pro Gly His Asn Leu Arg Trp He Leu Thr Phe lie 
65 70 75 80 

Leu Leu Phe Val Leu Val Cys Glu lie Ala Glu Gly He Leu Ser Asp 
85 90 95 

Gly Val Thr Glu Ser Arg His Leu His Leu Tyr Met Pro Ala Gly Met 
100 105 110 

Ala Phe Met Ala Ala lie Thr Ser Val Val Tyr Tyr His Asn He Glu 
115 120 125 

Thr Ser Asn Phe Pro Lys Leu Leu He Ala Leu Leu lie Tyr Trp Thr 
130 135 140 

Leu Ala Phe He Thr Lys Thr lie Lys Phe Val Lys Phe Tyr Asp His 
145 150 155 160 

Ala He Gly Phe Ser Gin Leu Arg Phe Cys Leu Thr Gly Leu Leu Val 
165 170 175 

lie Leu Tyr Gly Met Leu Leu Leu Val Glu Val Asn Val He Arg Val 
180 185 190 

Arg Arg Tyr Val Phe Phe Lys Thr Pro Arg Glu Val Lys Pro Pro Glu 
195 200 205 

Asp Leu Gin Asp Leu Gly Val Arg Phe Leu Gin Pro Phe Val Asn Leu 
210 215 220 

Leu Ser Lys Gly Thr Tyr Trp Trp Met Asn Ala Phe He Lys Thr Ala 
225 230 235 240 

His Lys Lys Pro He Asp Leu Arg Ala lie Gly Lys Leu Pro lie Ala 
245 " 250 255 

Met Arg Ala Leu Thr Asn Tyr Gin Arg Leu Cys Leu Ala Phe Asp Ala 
260 265 270 

Gin Ala Arg Lys Asp Thr Gin Ser Gin Gin Gly Ala Arg Ala He Trp 
275 280 285 

Arg Ala Leu Cys His Ala Phe Gly Arg Arg Leu Val Leu Ser Ser Thr 
290 295 300 

Phe Arg He Leu Ala Asp Leu Leu Gly Phe Ala Gly Pro Leu Cys lie 
305 310 315 320 

Phe Gly lie Val Asp His Leu Gly Lys Glu Asn His Val Phe Gin Pro 
325 330 335 

Lys Thr Gin Phe Leu Gly Val Tyr Phe Val Ser Ser Gin Glu Phe Leu 
340 345 350 

Gly Asn Ala Tyr Val Leu Ala Val Leu Leu Phe Leu Ala Leu Leu Leu 
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355 360 365 

Gin Arg Thr Phe Leu Gin Ala Ser Tyr Tyr Val Ala lie Glu Thr Gly 
370 375 380 

lie Asn Leu Arg Gly Ala lie Gin Thr Lys lie Tyr Asn Lys He Met 
385 390 395 400 

His Leu Ser Thr Ser Asn Leu Ser Met Gly Glu Met Thr Ala Gly Gin 
405 410 415 

lie Cys Asn Leu Val Ala lie Asp Thr Asn Gin Leu Met Trp Phe Phe 
420 425 430 

Phe Leu Cys Pro Asn Leu Trp Ala Met Pro Val Gin He He Val Gly 
435 440 445 

Val He Leu Leu Tyr Tyr He Leu Gly Val Ser Ala Leu He Gly Ala 
450 455 460 

Ala Val lie He Leu Leu Ala Pro Val Gin Tyr Phe Val Ala Thr Lys 
465 470 475 480 

Leu Ser Gin Ala Gin Arg Thr Thr Leu Glu Tyr Ser Asn Glu Arg Leu 
485 490 495 

Lys Gin Thr Asn Glu Met Leu Arg Gly He Lys Leu Leu Lys Leu Tyr 
500 505 510 

Ala Trp Glu Asn lie Phe Cys Ser Arg Val Glu Lys Thr Arg Arg Lys 
515 520 525 

Glu Met Thr Ser Leu Arg Ala Phe Ala Val Tyr Thr Ser lie Ser He 
530 535 S40 

Phe Met Asn Thr Ala He Pro He Ala Ala Val Leu lie Thr Phe Val 
545 550 555 560 

Gly His Val Ser Phe Phe Lys Glu Ser Asp Phe Ser Pro Ser Val Ala 
565 570 575 

Phe Ala Ser Leu Ser Leu Phe His He Leu Val Thr Pro Leu Phe Leu 
580 585 590 

Leu Ser Ser Val Val Arg Ser Thr Val Lys Ala Leu Val Ser Val Gin 
595 600 605 

Lys Leu Ser Glu Phe Leu Ser Ser Ala Glu lie Arg Glu Glu Gin Cys 
610 615 620 

Ala Pro Arg Glu Pro Ala Pro Gin Gly Gin Ala Gly Lys Tyr Gin Ala 
625 630 635 640 

Val Pro Leu Lys Val Val Asn Arg Lys Arg Pro Ala Arg Glu Glu Val 
645 650 655 

Arg Asp Leu Leu Gly Pro Leu Gin Arg Leu Thr Pro Ser Thr Asp Gly 
660 " 665 670 

Asp Ala Asp Asn Phe Cys Val Gin He He Gly Gly Phe Phe Thr Trp 
675 680 685 

Thr Pro Asp Gly He Pro Thr Leu Ser Asn He Thr He Arg He Pro 
690 695 700 

Arg Gly Gin Leu Thr Met lie Val Gly Gin Val Gly Cys Gly Lys Ser 
705 710 715 720 
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Ser Leu Leu Leu Ala Thr Leu Gly Glu Met Gin Lys Val Ser Gly Ala 
725 730 735 

Val Phe Trp Asn Ser Leu Pro Asp Ser Glu Gly Arg Arg Pro Gin Gin 
740 745 750 

Pro Arg Ala Gly Asp Ser Gly Arg Phe Gly Cys Gin Glu Gin Arg Pro 
755 760 765 

Cys Gly Tyr Ala Ser Gin Lys Pro Trp Leu Leu Asn Ala Thr Val Glu 
770 775 780 

Glu Asn lie Thr Phe Glu Ser Pro Phe Asn Lys Gin Arg Tyr Lys Met 
785 790 795 800 

Val lie Glu Ala Cys Ser Leu Gin Pro Asp lie Asp lie Leu Pro His 
805 810 815 

Gly Asp Gin Thr Gin He Gly Glu Arg Gly He Asn Leu Ser Thr Gly 
820 825 830 

Gly Gin Arg Pro Asp Gin Cys Arg Pro Glu Pro Ser Thr Ser Thr Pro 
835 840 845 

Met He Val Phe Leu Asp Asp Pro Phe Ser Ala Leu Asp Val His Leu 
850 855 860 

Ser Asp His Leu Met Gin Ala Gly lie Leu Glu Leu Leu Arg Asp Asp 
865 870 875 880 

Lys Arg Thr Val Val Leu Val Thr His Lys Leu Gin Tyr Leu Pro His 
885 890 895 

Ala Asp Trp He He Ala Met Lys Asp Gly Thr He Gin Arg Glu Gly 
900 905 910 

Thr Leu Lys Asp Phe Gin Arg Ser Glu Cys Gin Leu Phe Glu His Trp 
915 920 925 

Lys Thr Leu Met Asn Arg Gin Asp Gin Glu Leu Glu Lys Glu Thr Val 
930 935 940 

Met Glu Arg Lys Ala Pro Glu Pro Ser Gin Gly Leu Pro Arg Ala Met 
945 950 955 960 

Ser Ser Arg Asp Gly Leu Leu Leu Asp Glu Asp Glu Glu Glu Glu Glu 
965 970 975 

Ala Ala Glu Ser Glu Glu Asp Asp Asn Leu Ser Ser Val Leu His Gin 
980 985 990 

Arg Ala Lys He Pro Trp Arg Ala Cys Thr Lys Tyr Leu Ser Ser Ala 
995 1000 ' 1005 

Gly lie Leu Leu Leu Ser Leu Leu Val Phe Ser Gin Leu Leu Lys His 
1010 1015 1020 

Met Val Leu Val Ala lie Asp Tyr Trp Leu Ala Lys Trp Thr Asp Ser 
1025 1030 1035 1040 

Ala Leu Val Leu Ser Pro Ala Ala Arg Asn Cys Ser Leu Ser Gin Glu 
1045 1050 1055 

Cys Ala Leu Asp Gin Ser Val Tyr Ala Met Val Phe Thr Val Leu Cys 
1060 1065 1070 

Ser Leu Gly He Ala Leu Cys Leu Val Thr Ser Val Thr Val Glu Trp 
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Thr Gly Leu Lys Val Ala Lys Arg Leu His Arg Ser Leu Leu Asn Arg 
1090 1095 1100 

lie lie Leu Ala Pro Met Arg Phe Phe Glu Thr Thr Pro Leu Gly Ser 
1105 1110 1115 1120 

lie Leu Asn Arg Phe Ser Ser Asp Cys Asn Thr He Asp Gin His He 
1125 1130 1135 

Pro Ser Thr Leu Glu Cys Leu Ser Arg Ser Thr Leu Leu Cys Val Ser 
1140 1145 1150 

Ala Leu Ala Val He Ser Tyr Val Thr Pro Val Phe Leu Val Ala Leu 
1155 1160 1165 

Leu Pro Leu Ala Val Val Cys Tyr Phe He Gin Lys Tyr Phe Arg Val 
1170 1175 1180 

Ala Ser Arg Asp Leu Gin Gin Leu Asp Asp Thr Thr Gin Leu Pro Leu 
1185 1190 1195 1200 

Leu Ser His Phe Ala Glu Thr Val Glu Gly Leu Thr Thr He Arg Ala 
1205 1210 1215 

Phe Arg Tyr Glu Ala Arg Phe Gin Gin Lys Leu Leu Glu Tyr Thr Asp 
1220 1225 1230 

Ser Asn Asn He Ala Ser Leu Phe Leu Thr Ala Ala Asn Arg Trp Leu 
1235 1240 1245 

Glu Val Arg Met Glu Tyr He Gly Ala Cys Val Val Leu He Ala Ala 
1250 1255 1260 

Ala Thr Ser He Ser Asn Ser Leu His Arg Glu Leu Ser Ala Gly Leu 
1265 1270 1275 1280 

Val Gly Leu Gly Leu Thr Tyr Ala Leu Net He Gly He Cys Gly Arg 
1285 1290 1295 

Thr Gly Ser Gly Lys Ser Ser Phe Ser Leu Ala Phe Phe Arg Met Val 
1300 1305 1310 

Asp Met Phe Glu Gly Arg He He He Asp Gly He Asp He Ala Lys 
1315 1320 1325 

Leu Pro Leu His Thr Leu Arg Ser Arg Leu Ser He He Leu Gin Asp 
1330 1335 1340 

Pro Val Leu Phe Ser Gly Thr He Arg Phe Asn Leu Asp Pro Glu Lys 
1345 1350 1355 1360 

'Lys Cys Ser Asp Ser Thr Leu Trp Glu Ala Leu Glu He Ala Gin Leu 
1365 1370 1375 

Lys Leu Val Val Lys Ala Leu Pro Gly Gly Leu Asp Ala lie He Thr 
1380 1385 * 1390 

Glu Gly Gly Glu Asn Phe Ser Gin Gly Gin Arg Gin Leu Phe Cys Leu 
1395 1400 * 1405 

Ala Arg Ala Phe Val Arg Lys Thr Ser lie Phe He Met Asp Glu Ala 
1410 1415 1420 

Thr Ala Ser lie Asp Met Ala Thr Glu Asn He Leu Gin Lys Val Val 
1425 1430 1435 1440 
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Met Thr Ala Phe Ala Asp Arg Thr Val Val Thr lie Ala His Arg Val 

1445 1450 1455 

His Thr lie Leu Ser Ala Asp Leu Val Met Val Leu Lys Arg Gly Ala 

1460 " 1465 1470 

lie Leu Glu Phe Asp Lys Pro Glu Lys Leu Leu Ser Gin Lys Asp Ser 

1475 1480 1485 

Val Phe Ala Ser Phe Val Arg Ala Asp Lys 

1490 1495 



(7) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4625 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE ; cDNA 
<iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(ix) FEATURE: 

(A} NAME /KEY ; CDS 

(B) LOCATION; 25.. 4521 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

AGCCGAGCCC GTGCGCGCGC CGCC ATG CCC TTG GCC TTC TGC GGT ACC GAG 51 

Met Pro Leu Ala Phe Cys Gly Thr Glu 
1 5 

AAC CAC TCG GCC GCC TAC CGG GTG GAC CAG GGC GTC CTC AAC AAC GGC 99 

Asn His Ser Ala Ala Tyr Arg Val Asp Gin Gly Val Leu Asn Asn Gly 

10 15 20 25 

TGC TTC GTG GAC GCG CTC AAC GTG GTG CCG CAC GTT TTC CTG CTC TTC 147 

Cys Phe Val Asp Ala Leu Asn Val Val Pro His Val Phe Leu Leu Phe 

30 35 40 

ATC ACC TTC CCC ATC CTC TTC ATC GGA TGG GGC AGC CAG AGC TCC AAG 195 

lie Thr Phe Pro lie Leu Phe lie Gly Trp Gly Ser Gin Ser Ser Lys 

45 50 55 

GTG CAC ATC CAC CAC AGC ACC TGG CTG CAC TTT CCA GGG CAC AAC CTG 243 

Val His He His His Ser Thr Trp Leu His Phe Pro Gly His Asn Leu 
60 65 70 

CGC TGG ATC CTT ACC TTC ATT TTG CTC TTC GTC CTT GTG TGT GAG ATC 2 91 

Arg Trp He Leu Thr Phe He Leu Leu Phe Val Leu Val Cys Glu He 
75 80 85 

GCT GAG GGC ATC CTG TCT GAT GGG GTG ACA GAA TCC CGC CAC CTC CAC 33 9 

Ala Glu Gly He Leu Ser Asp Gly Val Thr Glu Ser Arg His Leu His 

90 95 100 105 

CTG TAC ATG CCA GCC GGG ATG GCG TTC ATG GCT GCC ATC ACC TCT GTA 387 

Leu Tyr Met Pro Ala Gly Met Ala Phe Met Ala Ala He Thr Ser Val 

110 115 120 
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GTC TAC TAT CAT AAC ATC GAG ACC TCC AAC TTC CCC AAG CTT TTG ATC 4 35 

Val Tyr Tyr His Asn lie Glu Thr Ser Asn Phe Pro Lys Leu Leu lie 
125 130 135 

GCT CTG CTC ATC TAT TGG ACC CTG GCC TTC ATC ACG AAG ACC ATC AAG 4 83 

Ala Leu Leu He Tyr Trp Thr Leu Ala Phe He Thr Lys Thr He Lys 
140 145 150 

TTT GTC AAG TTC TAT GAC CAC GCC ATC GGC TTC TCC CAG CTG CGC TTC 531 
Phe Val Lys Phe Tyr Asp His Ala He Gly Phe Ser Gin Leu Arg Phe 
155 160 165 

TGC CTC ACG GGG CTT CTG GTG ATC CTG TAT GGG ATG TTG CTG CTT GTG 57 9 

Cys Leu Thr Gly Leu Leu Val He Leu Tyr Gly Met Leu Leu Leu Val 
170 175 180 185 

GAG GTC AAC GTC ATC AGA GTG AGG AGG TAC ATC TTC TTC AAG ACG CCA 627 
Glu Val Asn Val He Arg Val Arg Arg Tyr He Phe Phe Lys Thr Pro 
190 195 200 

CGG GAG GTG AAG CCC CCT GAG GAC CTG CAG GAC CTG GGT GTG CGC TTT 6 75 

Arg Glu Val Lys Pro Pro Glu Asp Leu Gin Asp Leu Gly Val Arg Phe 
205 210 215 

CTG CAG CCC TTC GTT AAC CTG CTG TCA AAG GGG ACC TAT TGG TGG ATG 723 
Leu Gin Pro Phe Val Asn Leu Leu Ser Lys Gly Thr Tyr Trp Trp Met 
220 225 230 

AAT GCC TTC ATC AAG ACG GCC CAC AAG AAG CCC ATC GAC CTG CGG GCC 771 
Asn Ala Phe He Lys Thr Ala His Lys Lys Pro He Asp Leu Arg Ala 
235 240 245 

ATC GCG AAG CTG CCC ATC GCC ATG AGA GCC CTC ACC AAC TAT CAG CGC 819 
He Ala Lys Leu Pro lie Ala Met Arg Ala Leu Thr Asn Tyr Gin Arg 
250 255 260 265 

CTC TGC GTG GCC TTC GAT GCT CAG GCG CGG AAG GAC ACA CAG AGC CCA 867 
Leu Cys Val Ala Phe Asp Ala Gin Ala Arg Lys Asp Thr Gin Ser Pro 
270 275 280 

CAG GGT GCC CGG GCC ATC TGG AGG GCT CTA TGC CAT GCC TTT GGG AGA 915 
Gin Gly Ala Arg Ala He Trp Arg Ala Leu Cys His Ala Phe Gly Arg 
285 290 295 

CGC CTG ATC CTC AGC AGC ACA TTC CGC ATC CTG GCT GAC CTG TTG GGC 963 
Arg Leu He Leu Ser Ser Thr Phe Arg He Leu Ala Asp Leu Leu Gly 
300 305 310 

TTC GCT GGA CCA CTC TGC ATC TTT GGG ATC GTG GAC CAC CTG GGG AAG 1011 
Phe Ala Gly Pro Leu Cys He Phe Gly He Val Asp His Leu Gly Lys 
315 320 325 

GAG AAC CAC GTC TTC CAG CCC AAG ACA CAG TTT CTC GGG GTT TAC TTC 105 9 

'Glu Asn His Val Phe Gin Pro Lys Thr Gin Phe Leu Gly Val Tyr Phe 
330 335 340 345 

GTC TCT TCT CAA GAG TTC CTT GGC AAT GCC TAC GTC TTG GCC GTG CTT 1107 
Val Ser Ser Gin Glu Phe Leu Gly Asn Ala Tyr Val Leu Ala Val Leu 
350 355 360 

CTG TTC CTT GCC CTG CTA CTG CAA AGG ACA TTC CTG CAA GCC TCA TAC 115 5 

Leu Phe Leu Ala Leu Leu Leu Gin Arg Thr Phe Leu Gin Ala Ser Tyr 
365 370 375 

TAC GTC GCC ATT GAA ACT GGA ATT AAC CTG AGA GGA GCA ATC CAG ACC 12 03 

Tyr Val Ala He Glu Thr Gly He Asn Leu Arg Gly Ala He Gin Thr 
380 385 390 
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AAG ATT TAC AAT AAA ATC ATG CAC ATG TCC ACC TCC AAC CTG TCA ATG 1251 
Lys lie Tyr Asn Lys lie Met His Met Ser Thr Ser Asn Leu Ser Met 
395 400 405 

GGG GAA ATG ACT GCT GGG CAG ATC TGC AAC CTG GTG GCC ATC GAC ACA 12 99 

Gly Glu Met Thr Ala Gly Gin lie Cys Asn Leu Val Ala lie Asp Thr 
410 415 420 425 

AAC CAG CTC ATG TGG TTC TTC TTT CTG TGC CCA AAC CTC TGG ACG ATG 1347 
Asn Gin Leu Met Trp Phe Phe Phe Leu Cys Pro Asn Leu Trp Thr Met 
430 435 440 

CCA GTA CAG ATC ATT GTG GGC GTG ATC CTT CTC TAC TAC ATC CTT GGG 13 95 

Pro Val Gin lie He Val Gly Val He Leu Leu Tyr Tyr He Leu Gly 
445 450 455 

GTC AGT GCC TTG ATT GGA GCA GCT GTC ATC ATT CTG CTG GCT CCT GTA 1443 
Val Ser Ala Leu He Gly Ala Ala Val He lie Leu Leu Ala Pro Val 
460 465 470 

CAG TAC TTT GTG GCC ACC AAG CTC TCC CAG GCA CAG CGG ACG ACC TTG 14 91 

Gin Tyr Phe Val Ala Thr Lys Leu Ser Gin Ala Gin Arg Thr Thr Leu 
475 480 485 

GAG CAC TCC AAC GAG AGG CTG AAG CAG ACC AAC GAG ATG CTC CGG GGC 15 3 9 

Glu His Ser Asn Glu Arg Leu Lys Gin Thr Asn Glu Met Leu Arg Gly 
490 495 500 505 

ATG AAG CTG CTC AAA CTG TAT GCG TGG GAG AGC ATC TTC TGC TCC AGG 158 7 

Met Lys Leu Leu Lys Leu Tyr Ala Trp Glu Ser lie Phe Cys Ser Arg 
510 515 520 

GTG GAG GTG ACT CGC AGG AAG GAG ATG ACC AGC CTG AGG GCG TTT GCT 16 3 5 

Val Glu Val Thr Arg Arg Lys Glu Met Thr Ser Leu Arg Ala Phe Ala 
525 530 535 

GTC TAC ACT TCC ATC TCC ATC TTC ATG AAC ACA GCC ATC CCC ATT GCT 168 3 

Val Tyr Thr Ser He Ser He Phe Met Asn Thr Ala lie Pro lie Ala 
540 545 550 

GCC GTC CTC ATC ACC TTC GTG GGC CAC GTC AGC TTC TTC AAA GAG TCG 1731 
Ala Val Leu He Thr Phe Val Gly His Val Ser Phe Phe Lys Glu Ser 
555 560 565 

GAC TTG TCA CCC TCG GTG GCC TTT GCC TCC CTC TCT CTC TTC CAC ATC 177 9 

Asp Leu Ser Pro Ser Val Ala Phe Ala Ser Leu Ser Leu Phe His He 
570 575 580 585 

CTG GTC ACT CCA CTG TTC CTG CTG TCT AGC GTG GTT CGG TCC ACT GTC 1827 
Leu Val Thr Pro Leu Phe Leu Leu Ser Ser Val Val Arg Ser Thr Val 
590 595 600 

AAA GCC CTG GTG AGC GTG CAA AAA CTG AGC GAG TTC CTG TCT AGT GCA 187 5 

Lys Ala Leu Val Ser Val Gin Lys Leu Ser Glu Phe Leu Ser Ser Ala 
605 610 615 

GAG ATC CGT GAG GAG CAG TGT GCC CCC CGA GAG CCT GCA CCC CAA GGC 192 3 

Glu He Arg Glu Glu Gin Cys Ala Pro Arg Glu Pro Ala Pro Gin Gly 
620 625 630 

CAA GCC GGC AAG TAC CAG GCA GTG CCC CTC AAG GTT GTG AAC CGC AAA 1971 
Gin Ala Gly Lys Tyr Gin Ala Val Pro Leu Lys Val Val Asn Arg Lys 
635 " 640 645 

CGC CCA GCC CGG GAA GAG GTC CGG GAC CTC CTG GGC CCA CTG CAG AGG 2019 
Arg Pro Ala Arg Glu Glu Val Arg Asp Leu Leu Gly Pro Leu Gin Arg 
650 655 660 665 
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CTG GCC CCT AGC ATG GAC GGG GAT GCT GAC AAC TTC TGT GTC CAG ATC 2 06 7 

Leu Ala Pro Ser Met Asp Gly Asp Ala Asp Asn Phe Cys Val Gin lie 

670 675 680 

ATC GGA GGC TTC TTC ACC TGG ACC CCT GAT GGA ATC CCC ACT CTG TCC 2115 
lie Gly Gly Phe Phe Thr Trp Thr Pro Asp Gly He Pro Thr Leu Ser 
665 690 695 

AAC ATC ACC ATC CGT ATT CCC CGA GGT CAG CTA ACC ATG ATT GTG GGG 2163 

Asn He Thr He Arg He Pro Arg Gly Gin Leu Thr Met lie Val Gly 
700 705 710 

CAG GTG GGC TGC GGC AAG TCC TCG CTC CTC CTC GCC ACC CTG GGG GAG 2211 

Gin Val Gly Cys Gly Lys Ser Ser Leu Leu Leu Ala Thr Leu Gly Glu 
715 720 725 

ATG CAG AAG GTG TCG GGG GCC GTC TTC TGG AAC AGC AAC CTT CCG GAC 22 5 9 

Met Gin Lys Val Ser Gly Ala Val Phe Trp Asn Ser Asn Leu Pro Asp 
730 735 740 745 

AGC GAG GGG AGA GGA CCC CAG CAG CCC AGA GCG GGA GAC AGC AGC TGG 230 7 

Ser Glu Gly Arg Gly Pro Gin Gin Pro Arg Ala Gly Asp Ser Ser Trp 

750 755 760 

CTC GGA TAT CAG GAG CAG AGG CCC CGT GGC TAC GCA TCT CAG AAA CCA 2355 

Leu Gly Tyr Gin Glu Gin Arg Pro Arg Gly Tyr Ala Ser Gin Lys Pro 
765 770 775 

TGG CTG CTA AAC GCC ACC GTG GAA GAG AAC ATC ACC TTC GAG AGT CCC 24 0 3 

Trp Leu Leu Asn Ala Thr Val Glu Glu Asn He Thr Phe Glu Ser Pro 
780 785 790 

TTC AAT CCG CAG CGG TAC AAG ATG GTC ATC GAA GCC TGC TCC CTG CAG 2451 

Phe Asn Pro Gin Arg Tyr Lys Met Val He Glu Ala Cys Ser Leu Gin 
795 800 805 

CCG GAC ATA GAC ATC CTG CCC CAC GGA GAC CAG ACT CAG ATT GGG GAA 24 99 

Pro Asp He Asp lie Leu Pro His Gly Asp Gin Thr Gin He Gly Glu 
810 815 820 825 

CGG GGC ATC AAC CTG TCT GGT GGT CAG CGT CCA GAT CAG TGT GGT CCA 2547 

Arg Gly He Asn Leu Ser Gly Gly Gin Arg Pro Asp Gin Cys Gly Pro 

830 835 840 

GAG CCC TCT ACC AGC AGA CCA ATG TTC GTC TTC TTG GAT GAC CCC TTC 2 5 95 

Glu Pro Ser Thr Ser Arg Pro Met Phe Val Phe Leu Asp Asp Pro Phe 
845 850 855 

TCA GCT TTG GAT GTC CAT CTG AGT GAC CAC CTG ATG CAG GCC GGC ATC 2643 

Ser Ala Leu Asp Val His Leu Ser Asp His Leu Met Gin Ala Gly He 
860 865 870 

CTT GAG CTG CTC CGG GAT GAC AAG AGG AC A GTG GTC TTG GTG ACC CAC 26 91 

'Leu Glu Leu Leu Arg Asp Asp Lys Arg Thr Val Val Leu Val Thr His 
875 880 685 

AAG CTA CAG TAT CTG CCT CAT GCA GAC TGG ATC ATT GCC ATG AAG GAT 2 73 9 

Lys Leu Gin Tyr Leu Pro His Ala Asp Trp He He Ala Met Lys Asp 
890 895 900 905 

GGG ACC ATT CAG AGG GAA GGG ACG CTC AAG GAC TTC CAG AGG TCC GAG 278 7 

Glv Thr He Gin Arg Glu Gly Thr Leu Lys Asp Phe Gin Arg Ser Glu 

910 915 920 

TGC CAG CTC TTT GAG CAC TGG AAG ACC CTC ATG AAC CGG CAG GAC CAA 283 5 

Cys Gin Leu Phe Glu His Trp Lys Thr Leu Met Asn Arg Gin Asp Gin 
925 930 935 
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GAG CTG GAG AAG GAG ACA GTC ATG GAG AGG AAA GCC TCA GAG CCA TCT 28 83 

Glu Leu Glu Lys Glu Thr Val Met Glu Arg Lys Ala Ser Glu Pro Ser 
940 945 950 

CAG GGC CTG CCC CGT GCC ATG TCC TCC AGA GAC GGC CTT CTG CTG GAT 2931 
Gin Gly Leu Pro Arg Ala Met Ser Ser Arg Asp Gly Leu Leu Leu Asp 
955 960 965 

GAG GAA GAG GAG GAA GAG GAG GCA GCC GAA AGC GAG GAA GAT GAC AAC 2979 
Glu Glu Glu Glu Glu Glu Glu Ala Ala Glu Ser Glu Glu Asp Asp Asn 
970 975 980 985 

TTA TCT TCA GTG CTG CAT CAG CGA GCT AAG ATC CCC TGG CGA GCC TGC 3 02 7 

Leu Ser Ser Val Leu His Gin Arg Ala Lys lie Pro Trp Arg Ala Cys 
990 995 1000 

ACT AAG TAT CTG TCC TCT GCT GGC ATT CTG CTC CTG TCC CTG CTT GTC 3 0 75 

Thr Lys Tyr Leu Ser Ser Ala Gly lie Leu Leu Leu Ser Leu Leu Val 
1005 1010 1015 

TTC TCC CAG CTG CTC AAG CAC ATG GTC TTG GTG GCC ATT GAT TAT TGG 312 3 

Phe Ser Gin Leu Leu Lys His Met Val Leu Val Ala lie Asp Tyr Trp 
1020 1025 1030 

CTG GCC AAG TGG ACG GAC AGT GCC CTG GTC CTG AGC CCC GCT GCC AGG 3171 
Leu Ala Lys Trp Thr Asp Ser Ala Leu Val Leu Ser Pro Ala Ala Arg 
1035 1040 1045 

AAC TGT TCG CTC AGC CAG GAA TGT GAC CTG GAC CAG TCT GTC TAT GCC 3 219 

Asn Cys Ser Leu Ser Gin Glu Cys Asp Leu Asp Gin Ser Val Tyr Ala 
1050 1055 1060 1065 

ATG GTA TTC ACC TTG CTC TGC AGC CTG GGT ATC GTG CTG TGC CTG GTC 3267 
Met Val Phe Thr Leu Leu Cys Ser Leu Gly lie Val Leu Cys Leu Val 
1070 1075 1080 

ACC TCT GTC ACT GTG GAG TGG ACG GGA CTG AAG GTG GCC AAG AGG CTA 3 315 

Thr Ser Val Thr Val Glu Trp Thr Gly Leu Lys Val Ala Lys Arg Leu 
10S5 1090 1095 

CAC CGC AGC CTG CTC AAC CGC ATC ATC CTG GCC CCC ATG AGG TTC TTT 3 3 63 

His Arg Ser Leu Leu Asn Arg He He Leu Ala Pro Met Arg Phe Phe 
1100 1105 1110 

GAG ACC ACA CCC CTC GGG AGT ATC CTG AAC AGA TTT TCA TCC GAC TGT 3411 
Glu Thr Thr Pro Leu Gly Ser He Leu Asn Arg Phe Ser Ser Asp Cys 
1115 1120 1125 

AAC ACC ATT GAC CAG CAC ATC CCA TCC ACG CTG GAG TGT CTG AGC CGG 34 59 

Asn Thr He Asp Gin His He Pro Ser Thr Leu Glu Cys Leu Ser Arg 
1130 1135 1140 1145 

TCC ACC CTG CTG TGT GTC TCC GCC CTG ACT GTC ATC TCC TAT GTC ACA 3 507 

Ser Thr Leu Leu Cys Val Ser Ala Leu Thr Val He Ser Tyr Val Thr 
1150 1155 1160 

CCC GTG TTC CTC GTG GCC CTC TTA CCC CTA GCT GTT GTG TGC TAC TTC 3 5 55 

Pro Val Phe Leu Val Ala Leu Leu Pro Leu Ala Val Val Cys Tyr Phe 
1165 1170 1175 

ATT CAG AAG TAC TTC CGA GTG GCA TCC AGG GAC CTG CAG CAG CTG GAC 3 6 03 

He Gin Lys Tyr Phe Arg Val Ala Ser Arg Asp Leu Gin Gin Leu Asp 
1180 1185 " 1190 

GAC ACG ACG CAG CTC CCG CTC GTC TCA CAC TTT GCT GAA ACT GTG GAG 3 6 51 

Asp Thr Thr Gin Leu Pro Leu Val Ser His Phe Ala Glu Thr Val Glu 
1195 1200 1205 
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GGA CTC ACC ACC ATC CGT GCC TTC AGG TAC GAG GCC CGG TTC CAG CAG 36 9 9 

Gly Leu Thr Thr lie Arg Ala Phe Arg Tyr Glu Ala Arg Phe Gin Gin 
1210 1215 1220 1225 

AAG CTT CTA GAA TAT ACC GAC TCC AAC AAC ATC GCC TCC CTC TTC CTC 3 74 7 

Lys Leu Leu Glu Tyr Thr Asp Ser Asn Asn He Ala Ser Leu Phe Leu 
1230 1235 1240 

ACG GCA GCC AAC AGA TGG CTG GAA GTC TGC ATG GAG TAC ATC GGA GCG 37 95 

Thr Ala Ala Asa Arg Trp Leu Glu Val Cys Met Glu Tyr He Gly Ala 
1245 1250 1255 

TGC GTG GTA CTC ATT GCG GCT GCC ACC TCC ATC TCC AAC TCC CTG CAC 3 84 3 

Cys Val Val Leu He Ala Ala Ala Thr Ser He Ser Asn Ser Leu His 
1260 1265 1270 

AGG GAA CTT TCT GCT GGC CTG GTG GGC CTG GGC CTC ACC TAT GCC TTG 3 8 91 

Arg Glu Leu Ser Ala Gly Leu Val Gly Leu Gly Leu Thr Tyr Ala Leu 
1275 1280 1285 

ATG ATC GGG ATC TGC GGC CGC ACA GCG AGC GGG AAG TCC TCC TTC TCC 393 9 

Met He Gly He Cys Gly Arg Thr Ala Ser Gly Lys Ser Ser Phe Ser 
1290 1295 1300 1305 

CTG GCC TTT TTC CGA ATG GTG GAC ATG TTT GAA GGA CGC ATC ATC ATT 398 7 

Leu Ala Phe Phe Arg Met Val Asp Met Phe Glu Gly Arg He He He 
1310 1315 1320 

GAT GGC ATC GAC ATC GCC AAG CTG CCA CTT CAC ACG CTG CGC TCA CGC 4035 
Asp Gly lie Asp He Ala Lys Leu Pro Leu His Thr Leu Arg Ser Arg 
1325 1330 1335 

CTG TCC ATC ATC CTA CAG GAC CCC GTC CTC TTC AGC GGC ACG ATC AGA 4 083 

Leu Ser He lie Leu Gin Asp Pro Val Leu Phe Ser Gly Thr He Arg 
1340 1345 1350 

TTC AAC CTG GAC CCC GAG AAG AAA TGC TCA GAC AGC ACA CTG TGG GAG 4131 
Phe Asn Leu Asp Pro Glu Lys Lys Cys Ser Asp Ser Thr Leu Trp Glu 
1355 1360 1365 

GCC CTG GAG ATC GCC CAG CTG AAG CTG GTA GTG AAG GCA CTG CCA GGA 417 9 

Ala Leu Glu He Ala Gin Leu Lys Leu Val Val Lys Ala Leu Pro Gly 
1370 1375 1380 1385 

GGC CTA GAT GCC ATC ATC ACA GAA GGA GGG GAG AAT TTT AGC CAG GGC 422 7 

Gly Leu Asp Ala lie He Thr Glu Gly Gly Glu Asn Phe Ser Gin Gly 
1390 1395 1400 

CAG AGG CAG CTG TTC TGC CTG GCC CGG GCC TTC GTG AGG AAG ACC AGC 42 7 5 

Gin Arg Gin Leu Phe Cys Leu Ala Arg Ala Phe Val Arg Lys Thr Ser 
1405 1410 1415 

ATC TTC ATC ATG GAT GAA GCA ACC GCC TCC ATC GAC ATG GCT ACG GAG 4323 
'He Phe He Met Asp Glu Ala Thr Ala Ser He Asp Met Ala Thr Glu 
1420 1425 1430 

AAC ATC CTC CAG AAG GTG GTG ATG ACA GCC TTC GCA GAC CGC ACG GTG 4 3 71 

Asn He Leu Gin Lys Val Val Met Thr Ala Phe Ala Asp Arg Thr Val 
1435 1440 1445 

GTC ACC ATC GCG CAT CGT GTG CAC ACC ATC CTG AGT GCA GAC CTG GTG 4 419 

Val Thr He Ala His Arg Val His Thr He Leu Ser Ala Asp Leu Val 
1450 1455 1460 1465 

ATG GTC CTC AAG AGG GGT GCT ATC CTG GAG TTT GAC AAG CCA GAG ACG 44 6 7 

Met Val Leu Lys Arg Gly Ala He Leu Glu Phe Asp Lys Pro Glu Thr 
1470 1475 1480 
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CTC CTC AGC CAG AAG GAC AGC GTG TTC GCC TCC TTT GTC CGT GCG GAC 4 5X5 

Leu Leu Ser Gin Lys Asp Ser Val Phe Ala Ser Phe Val Arg Ala Asp 
1485 1490 1495 

AAG TGACTTACCG G AGC CAAAGT GCCACCCCGC GCCTCGCTTG CTTGCCTAGG 456 8 

Lys 

ATTTCTAACT GCAAATCACT TGTAAATAAA TTAATTCTTT GCTAAAAAAA AAAAAAA 462 5 

(8) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1498 amino acids 
(E) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Pro Leu Ala Phe Cys Gly Thr Glu Asn His Ser Ala Ala Tyr Arg 
15 10 15 

Val Asp Gin Gly Val Leu Asn Asn Gly Cys Phe Val Asp Ala Leu Asn 
20 25 30 

Val Val Pro His Val Phe Leu Leu Phe lie Thr Phe Pro lie Leu Phe 
35 40 45 

lie Gly Trp Gly Ser Gin Ser Ser Lys Val His He His His Ser Thr 
50 55 60 

Trp Leu His Phe Pro Gly His Asn Leu Arg Trp He Leu Thr Phe He 
65 70 75 80 

Leu Leu Phe Val Leu Val Cys Glu He Ala Glu Gly He Leu Ser Asp 
85 90 95 

Gly Val Thr Glu Ser Arg His Leu His Leu Tyr Met Pro Ala Gly Met 
100 105 110 

Ala Phe Met Ala Ala He Thr Ser Val Val Tyr Tyr His Asn He Glu 
115 120 125 

Thr Ser Asn Phe Pro Lys Leu Leu He Ala Leu Leu He Tyr Trp Thr 
130 135 140 

Leu Ala Phe He Thr Lys Thr lie Lys Phe Val Lys Phe Tyr Asp His 
145 150 155 160 

Ala lie Gly Phe Ser Gin Leu Arg Phe Cys Leu Thr Gly Leu Leu Val 
165 170 175 

He Leu Tyr Gly Met Leu Leu Leu Val Glu Val Asn Val He Arg Val 
180 185 190 

Arg Arg Tyr He Phe Phe Lys Thr Pro Arg Glu Val Lys Pro Pro Glu 
195 200 205 

Asp Leu Gin Asp Leu Gly Val Arg Phe Leu Gin Pro Phe Val Asn Leu 
210 215 220 

Leu Ser Lys Gly Thr Tyr Trp Trp Met Asn Ala Phe He Lys Thr Ala 
225 ' 230 * ~ 235 240 
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His Lys Lys Pro lie Asp Leu Arg Ala lie Ala Lys Leu Pro lie Ala 

245 250 255 

Met Arg Ala Leu Thr Asn Tyr Gin Arg Leu Cys Val Ala Phe Asp Ala 
260 265 270 

Gin Ala Arg Lys Asp Thr Gin Ser Pro Gin Gly Ala Arg Ala lie Trp 
275 280 285 

Arg Ala Leu Cys His Ala Phe Gly Arg Arg Leu He Leu Ser Ser Thr 
290 295 300 

Phe Arg He Leu Ala Asp Leu Leu Gly Phe Ala Gly Pro Leu Cys He 
305 310 315 320 

Phe Gly He Val Asp His Leu Gly Lys Glu Asn His Val Phe Gin Pro 
325 330 335 

Lys Thr Gin Phe Leu Gly Val Tyr Phe Val Ser Ser Gin Glu Phe Leu 
340 345 350 

Gly Asn Ala Tyr Val Leu Ala Val Leu Leu Phe Leu Ala Leu Leu Leu 
355 360 365 

Gin Arg Thr Phe Leu Gin Ala Ser Tyr Tyr Val Ala He Glu Thr Gly 
370 375 380 

He Asn Leu Arg Gly Ala He Gin Thr Lys He Tyr Asn Lys lie Met 
385 390 395 400 

His Met Ser Thr Ser Asn Leu Ser Met Gly Glu Met Thr Ala Gly Gin 
405 410 415 

He Cys Asn Leu Val Ala He Asp Thr Asn Gin Leu Met Trp Phe Phe 
420 425 430 

Phe Leu Cys Pro Asn Leu Trp Thr Met Pro Val Gin He He Val Gly 
435 440 445 

Val He Leu Leu Tyr Tyr He Leu Gly Val Ser Ala Leu lie Gly Ala 
450 455 460 

Ala Val lie He Leu Leu Ala Pro Val Gin Tyr Phe Val Ala Thr Lys 
465 470 475 480 

Leu Ser Gin Ala Gin Arg Thr Thr Leu Glu His Ser Asn Glu Arg Leu 
4B5 490 495 

Lys Gin Thr Asn Glu Met Leu Arg Gly Met Lys Leu Leu Lys Leu Tyr 
500 505 510 

Ala Trp Glu Ser He Phe Cys Ser Arg Val Glu Val Thr Arg Arg Lys 
515 520 525 

Glu Met Thr Ser Leu Arg Ala Phe Ala Val Tyr Thr Ser He Ser lie 
530 535 540 

Phe Met Asn Thr Ala He Pro He Ala Ala Val Leu He Thr Phe Val 
545 550 555 560 

Gly His Val Ser Phe Phe Lvs Glu Ser Asp Leu Ser Pro Ser Val Ala 
565 570 575 

Phe Ala Ser Leu Ser Leu Phe His He Leu Val Thr Pro Leu Phe Leu 
580 585 590 

Leu Ser Ser Val Val Arg Ser Thr Val Lys Ala Leu Val Ser Val Gin 
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595 600 605 

Lys Leu Ser Glu Phe Leu Ser Ser Ala Glu lie Arg Glu Glu Gin Cys 
610 615 620 

Ala Pro Arg Glu Pro Ala Pro Gin Gly Gin Ala Gly Lys Tyr Gin Ala 
- 625 630 635 640 

Val Pro Leu Lys Val Val Asn Arg Lys Arg Pro Ala Arg Glu Glu Val 
645 650 655 

Arg Asp Leu Leu Gly Pro Leu Gin Arg Leu Ala Pro Ser Met Asp Gly 
660 665 670 

Asp Ala Asp Asn Phe Cys Val Gin lie He Gly Gly Phe Phe Thr Trp 
675 680 685 

Thr Pro Asp Gly He Pro Thr Leu Ser Asn He Thr He Arg lie Pro 
690 695 700 

Arg Gly Gin Leu Thr Met He Val Gly Gin Val Gly Cys Gly Lys Ser 
705 710 715 720 

Ser Leu Leu Leu Ala Thr Leu Gly Glu Met Gin Lys Val Ser Gly Ala 
725 730 735 

Val Phe Trp Asn Ser Asn Leu Pro Asp Ser Glu Gly Arg Gly Pro Gin 
740 745 750 

Gin Pro Arg Ala Gly Asp Ser Ser Trp Leu Gly Tyr Gin Glu Gin Arg 
755 760 765 

Pro Arg Gly Tyr Ala Ser Gin Lys Pro Trp Leu Leu Asn Ala Thr Val 
770 775 780 

Glu Glu Asn He Thr Phe Glu Ser Pro Phe Asn Pro Gin Arg Tyr Lys 
785 790 795 800 

Met Val lie Glu Ala Cys Ser Leu Gin Pro Asp lie Asp lie Leu Pro 
805 810 815 

His Gly Asp Gin Thr Gin He Gly Glu Arg Gly lie Asn Leu Ser Gly 
820 825 830 

Gly Gin Arg Pro Asp Gin Cys Gly Pro Glu Pro Ser Thr Ser Arg Pro 
835 840 845 

Met Phe Val Phe Leu Asp Asp Pro Phe Ser Ala Leu Asp Val His Leu 
850 855 860 

Ser Asp His Leu Met Gin Ala Gly He Leu Glu Leu Leu Arg Asp Asp 
865 870 875 880 

Lys Arg Thr Val Val Leu Val Thr His Lys Leu Gin Tyr Leu Pro His 
885 890 895 

Ala Asp Trp lie lie Ala Met Lys Asp Gly Thr lie Gin Arg Glu Gly 
900 905 910 

Thr Leu Lvs Asp Phe Gin Arg Ser Glu Cys Gin Leu Phe Glu His Trp 
915 920 925 

Lys Thr Leu Met Asn Arg Gin Asp Gin Glu Leu Glu Lys Glu Thr Val 
930 935 940 

Met Glu Arg Lys Ala Ser Glu Pro Ser Gin Gly Leu Pro Arg Ala Met 
945 ' 950 955 960 
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Ser Ser Arg Asp Gly Leu Leu Leu Asp Glu Glu Glu Glu Glu Glu Glu 
965 970 975 

Ala Ala Glu ser Glu Glu Asp Asp Asn Leu Ser Ser Val Leu His Gin 

985 990 

Arg Ala Lys He Pro Trp Arg Ala Cys Thr Lys Tyr Leu Ser Ser Ala 
3 1000 1005 

Gly lie Leu Leu Leu Ser Leu Leu Val Phe Ser Gin Leu Leu Lys His 

1015 1020 

Met V.l Leu Val Ala lie Asp Tyr Trp Leu Ala Lys Trp Thr Asp Ser 

1030 1035 1040 

Ala Leu Val Leu Ser Pro Ala Ala Arg Asn Cys Ser Leu Ser Gin Glu 
1045 "SO i 0S5 

Cys Asp Leu Asp Gin Ser Val Tyr Ala Met Val Phe Thr Leu Leu Cys 
1060 1065 1070 y 

Ser Leu Gly^le Val Leu Cys LeuVal Thr Ser Val Thr Val Glu Trp 
Thr Gly Leu Lys Val Ala Lys Arg Leu His Arg Ser Leu Leu Asn Arg 

1095 110Q 

11^ lie Leu Ala Pro Met Arg Phe Phe Glu Thr Thr Pro Leu Gly Ser 
X11 ° 1120 

He Leu Asn Arg Phe Ser Ser Asp Cys Asn Thr He Asp Gin His He 
1125 1130 



1135 



Pro Ser Thr Leu Glu Cys Leu Ser Arg Ser Thr Leu Leu Cys Val Ser 

* U 1145 1150 

Ala Leu Thr Val lie Ser Tyr Val Thr Pro Val Phe Leu Val Ala Leu 
1155 USD 1165 

Leu Pro Leu Ala Val Val Cys Tyr Phe lie Gin Lys Tyr Phe Arg Val 

H" 5 1180 

Ala ser Arg Asp Leu Gin Gin Leu Asp Asp Thr Thr Gin Leu Pro Leu 

1190 H95 1200 

Val ser His Phe Ala Glu Thr Val Glu Gly Leu Thr Thr He Arg Ala 
1205 1210 1215 

Phe Arg Tyr Glu Ala Arg Phe Gin Gin Lys Leu Leu Glu Tyr Thr Asp 
1220 1225 ^ 



1230 



Ser Asn As^ile Ala Ser Leu Ph^Leu Thr Ala Ala Asn Arg Trp Leu 

G1U Ifso 0 ^ G1U ^ 116 GlY Ala CyS Val Val Leu He Ala Ala 

x, " u 1255 126O 

Ala Thr Ser lie Ser Asn Ser Leu His Arg Glu Leu Ser Ala Gly Leu 
65 1270 1275 1280 

Val Gly Leu Gly Leu Thr Tyr Ala Leu Met lie Gly lie Cvs Glv Ara 
1285 1290 ' i 2 - 95 y 

Thr Ala Ser Gly Lys Ser Ser Phe Ser Leu Ala Phe Phe Arg Met Val 
1300 1305 12 * Q 

Asp Met Phe Glu Glv Arg He lie He Asp Gly He Asp He Ala Lys 
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1315 1320 1325 

Leu Pro Leu His Thr Leu Arg Ser Arg Leu Ser He He Leu Gin Asp 
1330 1335 1340 

Pro Val Leu Phe Ser Gly Thr He Arg Phe Asn Leu Asp Pro Glu Lys 
1345 1350 1355 1360 

Lys Cys Ser Asp Ser Thr Leu Trp Glu Ala Leu Glu He Ala Gin Leu 
1365 1370 1375 

Lys Leu Val Val Lys Ala Leu Pro Gly Gly Leu Asp Ala He lie Thr 
1380 1385 1390 

Glu Gly Gly Glu Asn Phe Ser Gin Gly Gin Arg Gin Leu Phe Cys Leu 
1395 1400 1405 

Ala Arg Ala Phe Val Arg Lys Thr Ser lie Phe He Met Asp Glu Ala 
1410 1415 1420 

Thr Ala Ser He Asp Met Ala Thr Glu Asn He Leu Gin Lys Val Val 
1425 1430 1435 1440 

Met Thr Ala Phe Ala Asp Arg Thr Val Val Thr He Ala His Arg Val 
1445 1450 1455 

His Thr He Leu Ser Ala Asp Leu Val Met Val Leu Lys Arg Gly Ala 
1460 1465 1470 

He Leu Glu Phe Asp Lys Pro Glu Thr Leu Leu Ser Gin Lys Asp Ser 
1475 1480 1485 

Val Phe Ala Ser Phe Val Arg Ala Asp Lys 
1490 1495 



(9) INFORMATION FOR SEQ ID NO: 8: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: 'amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Pro Leu Ala Phe Cys Gly Thr 

1 5 

(10) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 
{ B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Asn His Ser Ala Ala Tyr Arg Val Asp Gin Gly 
1 5 10 



(11) INFORMAT ION FOR SEQ ID NO: 10: 
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(i) SEQUENCE CHARACTERISTICS : 

CA) LENGTH: 26 base pairs 
(B) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
GAGAGAAGCT TNTGNGGNGA NAANCA 
(12) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GAGAGAGAAT TCCNTGNTCN ACNCNNTA 

(13) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO:12: 
TTTTG CGGGA CGG AG AAT C A CTCGGCCGCC TACCGCGTCG ACCAAGG 4? 
(14) INFORMATION FOR SEQ ID NO: 13; 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
'GCCNCCAUG 

9 

(15) INFORMATION FOR SEQ ID NO: 14; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
GXXGXGK 
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(16) INFORMATION FOR SEQ ID NO : 15 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

XXXD 



(17) INFORMATION FOR SEQ ID NO; 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IB base pairs 

(B) TYPE; nucieic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acids 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
CACGCTCAGG TTCTGGAT 18 

(18) INFORMATION FOR SEQ ID NO; 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acids 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
TCAACTGGAT GGTGAGGA 18 
(17) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TGACATCGCC AAACTGC 

17 

(18) INFORMATION FOR SEQ ID NO; 19: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH; 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TCCTGGCAGT GCCTTCA 

17 

(19) INFORMATION FOR SEQ ID NO; 20: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : YES 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20; 
TCCTCTCAGG GTCCAGGTTA 

20 

(20) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH: 17 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: nucleic acid 
(iii) HYPOTHETICAL; NO 
(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
ACAAGGAGCC TGGGGAT 

17 

(21) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
( x i i ) HYPOTHETI CAL : NO 
(iv) ANT I- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TGCATGGGTC CCAGTGA 

17 

(22) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 25 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
TTGACCATTC ACCACATTGG TGTGC 
(23) INFORMATION FOR SEQ ID NO:24: 

( i ) S EQUENCE CHARACTER ISTICS : 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
TCCTGGCAGT GCCTTCA 

(24) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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fiii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
ACCATCGACC AG C AC AT C 

(2 5) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 08 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 26: 
GGACCTGCAG CAGCTGGATG ACACCACCCA GCTTCCACTT CTCTCACACT TTGCCGAAAC 
CGTAGAAGGA CTCACCACCA TCCGGGCCTT CAGGTATGAG GCCCGGTTCC AGCAGAAGCT 
TCTCGAATAC ACAGACTCCA AC AAC ATTG C TTCCCTCTTC CTCACAGCTG CCAACAGATG 
GCTGGAAGTC CGAATGGAGT ACATCGGTGC ATGTGTGGTG CTCATCGCAG CGGTGACCTC 
CATCTCCAAC TCCCTGCACA GGGAGCTCTC TGCTGGCCTG GTGGGCCTGG GCCTTACCTA 
CGCCCTAATG GTCTCCAACT ACCTCAACTG GATGGTGAGG AACCTGGCAG ACATGGAGCT 
CCAGCTGGGG GCTGTGAAGC GCATCCATGG GCTCCTGAAA ACCGAGGCAG AGAGCTACGA 
GGGACTCCTG GCACCATCGC TGAT C C C AAA GAACTGGCCA GACCAAGGGA AGATCCAGAT 
CCAGAACCTG AGCGTGCGCT ACGACAGCTC CCTGAAGCCG GTGCTGAAGC ACGTCAATGC 
CCTCATCTCC C CTGG AC AGA AGATCGGGAT CTGCGGCCGC ACCGGCAGTG GGAAGTCCTC 
CTTCTCTCTT GCCTTCTTCC G C ATGG TG G A CACGTTCGAA GGGCACATCA TCATTGATGG 
CATTGACATC GCCAAACTGC CGCTGCACAC CCTGCGCTCA CGCCTCTCCA TCATCCTGCA 
GGACCCCGTC CTCTTCAGCG GCACCATCCG ATTTAACCTG GACCCTGAGA GGAAGTCCTC 
AGATAGCACA CTGTGGGAGG CCCTGGAAAT CGCCCAGCTG AAGCTGGTGG TGAAGGCACT 
GCCAGGAGGC CTCGATGCCA TCATCACAGA AGGCGGGGAG AATTTC AG C C AGGGACAGAG 
GCAGCTGTTC TGCCTGGCCC GGGCCTTCGT GAGGAAGACC AGCATCTTCA TCATGGACGA 
GGCCACGGCT TCCATTGACA TGGCCACGGA A^CATCCTC CAAAAGGTGG TGATGACAGC 
CTTCGCAGAC CGCACTGTGG TCACCATCGC GCATCGAGTG CACACCATCC TG AG TG C AGA 
CCTGGTGATC GTCCTGAAGC GGGGTGCCAT CCTTGAGTTC G ATAAG C C AG AGAAGCTGCT 
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60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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CAGCCGGAAG GACAGCGTCT TCGCCTCCTT CGTCCGTGCA GACAAGTGAC CTGCCAGAGC 12 0 0 

CCAAGTGCCA TCCCACATTC GGACCCTGCC CATACCCCTG CCTGGGTTTT CTAACTGTAA 12 6 0 

ATCACTTGTA AATAAATAGA TTTGATTATT TCCTAAAAAA AAAAAAAA 13 08 
(26) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1308 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE : NO 

(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 2. ,1186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

G GAC CTG CAG CAG CTG GAT GAC ACC ACC CAG CTT CCA CTT CTC TCA 46 
Asp Leu Gin Gin Leu Asp Asp Thr Thr Gin Leu Pro Leu Leu Ser 
15 10 15 

CAC TTT GCC GAA ACC GTA GAA GGA CTC ACC ACC ATC CGG GCC TTC AGG 94 
His Phe Ala Glu Thr Val Glu Gly Leu Thr Thr He Arg Ala Phe Arg 
20 25 30 

TAT GAG GCC CGG TTC CAG CAG AAG CTT CTC GAA TAC ACA GAC TCC AAC 142 
Tyr Glu Ala Arg Phe Gin Gin Lys Leu Leu Glu Tyr Thr Asp Ser Asn 
35 40 ■ 45 

AAC ATT GCT TCC CTC TTC CTC ACA GCT GCC AAC AGA TGG CTG GAA GTC 190 
Asn He Ala Ser Leu Phe Leu Thr Ala Ala Asn Arg Trp Leu Glu Val 
50 55 60 

CGA ATG GAG TAC ATC GGT GCA TGT GTG GTG CTC ATC GCA GCG GTG ACC 2 38 

Arg Met Glu Tyr He Gly Ala Cys Val Val Leu He Ala Ala Val Thr 
65 70 75 

TCC ATC TCC AAC TCC CTG CAC AGG GAG CTC TCT GCT GGC CTG GTG GGC 286 
Ser He Ser Asn Ser Leu His Arg Glu Leu Ser Ala Gly Leu Val Gly 
80 85 90 95 

CTG GGC CTT ACC TAC GCC CTA ATG GTC TCC AAC TAC CTC AAC TGG ATG 334 
Leu Gly Leu Thr Tyr Ala Leu Met val Ser Asn Tyr Leu Asn Trp Met 
100 105 110 

GTG AGG AAC CTG GCA GAC ATG GAG CTC CAG CTG GGG GCT GTG AAG CGC 3 82 

Val Arg Asn Leu Ala Asp Met Glu Leu Gin Leu Gly Ala Val Lys Arg 
115 120 125 

ATC CAT GGG CTC CTG AAA ACC GAG GCA GAG AGC TAC GAG GGA CTC CTG 430 
He His Glv Leu Leu Lys Thr Glu Ala Glu Ser Tyr Glu Gly Leu Leu 
130 135 140 

GCA CCA TCG CTG ATC CCA AAG AAC TGG CCA GAC CAA GGG AAG ATC CAG 4 78 

Ala Pro Ser Leu He Pro Lys Asn Trp Pro Asp Gin Gly Lys He Gin 
145 150 155 
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ATC CAG AAC CTG AGC GTG CGC TAC GAC AGC TCC CTG AAG CCG GTG CTG 52 6 

lie Gin Asn Leu Ser Val Arg Tyr Asp Ser Ser Leu Lys Pro Val Leu 
160 165 170 175 

AAG CAC GTC AAT GCC CTC ATC TCC CCT GGA CAG AAG ATC GGG ATC TGC 5 74 

Lys His Val Asn Ala Leu lie Ser Pro Gly Gin Lys lie Gly lie Cys 
180 185 190 

GGC CGC ACC GGC AGT GGG AAG TCC TCC TTC TCT CTT GCC TTC TTC CGC 622 
Gly Arg Thr Gly Ser Gly Lys Ser Ser Phe Ser Leu Ala Phe Phe Arg 
195 200 205 

ATG GTG GAC ACG TTC GAA GGG CAC ATC ATC ATT GAT GGC ATT GAC ATC 67 0 

Met Val Asp Thr Phe Glu Gly His lie He He Asp Gly He Asp He 
210 215 220 

GCC AAA CTG CCG CTG CAC ACC CTG CGC TCA CGC CTC TCC ATC ATC CTG 718 
Ala Lys Leu Pro Leu His Thr Leu Arg Ser Arg Leu Ser He lie Leu 
225 230 235 

CAG GAC CCC GTC CTC TTC AGC GGC ACC ATC CGA TTT AAC CTG GAC CCT 766 
Gin Asp Pro Val Leu Phe Ser Gly Thr He Arg Phe Asn Leu Asp Pro 
240 245 250 255 

GAG AGG AAG TGC TCA GAT AGC AC A CTG TGG GAG GCC CTG GAA ATC GCC 614 
Glu Arg Lys Cys Ser Asp Ser Thr Leu Trp Glu Ala Leu Glu He Ala 
260 265 270 

CAG CTG AAG CTG GTG GTG AAG GCA CTG CCA GGA GGC CTC GAT GCC ATC 86 2 

Gin Leu Lys Leu Val Val Lys Ala Leu Pro Gly Gly Leu Asp Ala He 
275 280 285 

ATC ACA GAA GGC GGG GAG AAT TTC AGC CAG GGA CAG AGG CAG CTG TTC 910 
He Thr Glu Gly Gly Glu Asn Phe Ser Gin Gly Gin Arg Gin Leu Phe 
290 295 300 

TGC CTG GCC CGG GCC TTC GTG AGG AAG ACC AGC ATC TTC ATC ATG GAC 95 8 

Cys Leu Ala Arg Ala Phe Val Arg Lys Thr Ser He Phe lie Met Asp 
305 310 315 

GAG GCC ACG GCT TCC ATT GAC ATG GCC ACG GAA AAC ATC CTC CAA AAG 100 6 

Glu Ala Thr Ala Ser He Asp Met Ala Thr Glu Asn He Leu Gin Lys 
320 325 330 335 

GTG GTG ATG ACA GCC TTC GCA GAC CGC ACT GTG GTC ACC ATC GCG CAT 1054 
Val Val Met Thr Ala Phe Ala Asp Arg Thr Val Val Thr lie Ala His 
340 345 350 

CGA GTG CAC ACC ATC CTG AGT GCA GAC CTG GTG ATC GTC CTG AAG CGG 1102 
Arg Val His Thr He Leu Ser Ala Asp Leu Val He Val Leu Lys Arg 
355 360 365 

GGT GCC ATC CTT GAG TTC GAT AAG CCA GAG AAG CTG CTC AGC CGG AAG 115 0 

Gly Ala He Leu Glu Phe Asp Lys Pro Glu Lys Leu Leu Ser Arg Lys 
370 375 380 

GAC AGC GTC TTC GCC TCC TTC GTC CGT GCA GAC AAG TGACCTGCCA 1196 
Asp Ser Val Phe Ala Ser Phe Val Arg Ala Asp Lys 
385 390 395 

GAGCCCAAGT GCCATCCCAC ATTCGGACCC TGCCCATACC CCTGCCTGGG TTTTCTAACT 12 56 

GTAAATCACT TGTAAATAAA TAGATTTGAT TATTTCCTAA AAAAAAAAAA AA 13 0 8 



(2 7) INFORMATION FOR SEQ ID NO: 28: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 95 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Asp Leu Gin Gin Leu Asp Asp Thr Thr Gin Leu Pro Leu Leu Ser His 
15 10 IS 

Phe Ala Glu Thr Val Glu Gly Leu Thr Thr lie Arg Ala Phe Arg Tyr 
20 25 30 

Glu Ala Arg Phe Gin Gin Lys Leu Leu Glu Tyr Thr Asp Ser Asn Asn 
35 40 45 

He Ala Ser Leu Phe Leu Thr Ala Ala Asn Arg Trp Leu Glu Val Arg 
50 55 60 

Met Glu Tyr lie Gly Ala Cys Val Val Leu He Ala Ala Val Thr Ser 
65 70 75 80 

He Ser Asn Ser Leu His Arg Glu Leu Ser Ala Gly Leu Val Gly Leu 
85 90 95 

Gly Leu Thr Tyr Ala Leu Met Val Ser Asn Tyr Leu Asn Trp Met Val 
100 105 . 110 

Arg Asn Leu Ala Asp Met Glu Leu Gin Leu Gly Ala Val Lys Arg He 
115 120 125 

His Gly Leu Leu Lys Thr Glu Ala Glu Ser Tyr Glu Gly Leu Leu Ala 
130 135 140 

Pro Ser Leu He Pro Lys Asn Trp Pro Asp Gin Gly Lys He Gin He 
145 150 155 160 

Gin Asn Leu Ser Val Arg Tyr Asp Ser Ser Leu Lys Pro Val Leu Lys 
165 170 175 

His Val Asn Ala Leu He Ser Pro Gly Gin Lys He Gly He Cys Gly 
180 185 190 

Arg Thr Gly Ser Gly Lys Ser Ser Phe Ser Leu Ala Phe Phe Arg Met 
195 " 200 205 

Val Asp Thr Phe Glu Gly His lie lie He Asp Gly lie Asp He Ala 
210 215 220 

Lys Leu Pro Leu His Thr Leu Arg Ser Arg Leu Ser lie lie Leu Gin 
225 230 235 240 

Asp Pro Val Leu Phe Ser Gly Thr He Arg Phe Asn Leu Asp Pro Glu 
245 250 255 

Arg Lys Cys Ser Asp Ser Thr Leu Trp Glu Ala Leu Glu lie Ala Gin 
260 265 270 

Leu Lvs Leu Val Val Lys Ala Leu Pro Gly Gly Leu Asp Ala He He 
275 280 285 

Thr Glu Gly Gly Glu Asn Phe Ser Gin Gly Gin Arg Gin Leu Phe Cys 
290 * * 29S ' 300 

Leu Ala Arg Ala Phe Val Arg Lys Thr Ser He Phe He Met Asp Glu 
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305 310 315 320 

Ala Thr Ala Ser lie Asp Met Ala Thr Glu Asn lie Leu Gin Lys Val 
325 330 335 

Val Met Thr Ala Phe Ala Asp Arg Thr Val Val Thr lie Ala His Arg 
340 345 350 

Val His Thr lie Leu Ser Ala Asp Leu Val He Val Leu Lys Arg Gly 
355 360 365 

Ala He Leu Glu Phe Asp Lys Pro Glu Lys Leu Leu Ser Arg Lys Asp 
370 375 380 

Ser Val Phe Ala Ser Phe Val Arg Ala Asp Lys 
385 390 395 



(28) INFORMATION FOR SEQ ID NO : 2 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

CCATGCCTGG TGGCTGAGCC CAGCCCAGCC CCCAGCACCA TCGCTGATCC CAAAGAACTG 6 0 

G C C AG AC C AA GGGAAGATCC AGATCCAGAA CCTGAGCGTG CGCTACGACA GCTCCCTGAA 12 0 

GCCGGTGCTG AAGCACGTCA ATGCCCTCAT CTCCCCTGGA CAGAAGGTCA GTGCACGGGC 18 0 

CCAACCCAAT GCTGC 195 
(29) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CTGCTCTCTG CAGCCAGACA TCGACATCCT GCCCCATGGA GACCAGACCC AGATTGGGGA 6 0 

ACGGGGCATC AACCTGTCTG GTGGTCAACG CCAGCGAATC AGTGTGGCCC GAGCCCTCTA 12 0 

CCAGCACGCC AACGTTGTCT TCTTGGATGA CCCCTTCTCA GCTCTGGATA TCCATCTGAG 18 0 
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TGACCACTTA 


ATGCAGGCCG 


GCATCCTTGA 


GCTGCTCCGG 


GACGACAAGA 


GGACAGTGGT 


240 


CTTAGTGACC 


CACAAGCTAC 


AGTACCTGCC 


CCATGCAGAC 


TGGATCATTG 


CCATGAAGGA 


300 


TGGCACCATC 


CAGAGGGAGG 


GTACCCTCAA 


GGACTTCCAG 


AGGTCTGAAT 


GCCAGCTCTT 


360 


TGAGCACTGG 


AAGACCCTCA 


TGAACCGACA 


GGACCAAGAG 


CTGGAGAAGG 


AGACTGTCAC 


420 


AGAGAGAAAA 


G C C AC AG AG C 


CACCCCAGGG 


CCTATCTCGT 


GCCATGTCCT 


CGAGGGATGG 


480 


CCTTCTGCAG 


GATGAGGAAG 


AGGAGGAAGA 


GGAGGCAGCT 


GAGAGCGAGG 


AGGATGACAA 


540 


CCTGTCGTCC 


ATGCTGCACC 


AGCGTGCTGA 


GATCCCATGG 


CGAGCCTGCG 


CCAAGTACCT 


600 


GTCCTCCGCC 


GGCATCCTGC 


TCCTGTCGTT 


GCTGGTCTTC 


TCACAGCTGC 


TCAAGCACAT 


660 


GGTCCTGGTG 


GCCATCGACT 


ACTGGCTGGC 


CAAGTGG AC C 


GACAGCGCCC 


TGACCCTGAC 


720 


CCCTGCAGCC 


AGGAACTGCT 


CCCTCAGCCA 


GGAGTGCACC 


CTCGACCAGA 


CTGTCTATGC 


780 


CATGGTGTTC 


ACGGTGCTCT 


GCAGCCTGGG 


CATTGTGCTG 


TGCCTCGTCA 


CGTCTGTCAC 


840 


TGTGGAGTGG 


ACAGGGCTGA 


AGGTGGCCAA 


GAGACTGCAC 


CGCAGCCTGC 


TAAACCGGAT 


900 


CATCCTAGCC 


CCCATGAGGT 


TTTTTGAGAC 


CACTCCCCTT 


G GGAG CAT CC 


TGAACAGATT 


960 


TTCATCTGAC 


TGTAACACCA 


TCGACCAGCA 


CATCCCATCC 


ACGCTGGAGT 


GCCTGAGCCG 


1020 


CTCCACCCTG 


CTCTGTGTCT 


CAGCCCTGGC 


CGTCATCTCC 


TATGTCACAC 


CTGTGTTCCT 


1080 


CGTGGCCCTC 


CTTCCCCTGG 


CCATCGTGTG 


CTACTTCATC 


CAGAAGTACT 


TCCGGGTGGC 


1140 


GTCCAGGGAC 


CTGCAGCAGC 


TGGATGACAC 


CACCCAGCTT 


CCACTTCTCT 


CACACTTTGC 


1200 


CGAAACCGTA 


G AAGG ACT C A 


CCACCATCCG 


GGCCTTCAGG 


TATGAGGCCC 


GGTTCCAGCA 


1260 


GAAGCTTCTC 


GAATACACAG 


ACTCCAACAA 


CATTGCTTCC 


CTCTTCCTCA 


CAGCTGCCAA 


1320 


CAGATGGCTG 


GAAGTCCGAA 


TGGAGTACAT 


CGGTGCATGT 


GTGGTGCTCA 


TCGCAGCGGT 


1380 


GACCTCCATC 


TCCAACTCCC 


TGCACAGGGA 


GCTCTCTGCT 


GGCCTGGTGG 


GCCTGGGCCT 


1440 


TACCTACGCC 


CTAATGGTCT 


CCAACTACCT 


CAACTGGATG 


GTGAGGAACC 


TGGCAGACAT 


1500 


GGAGCTCCAG 


CTGGGGGCTG 


TGAAGCGCAT 


CCATGGGCTC 


CTGAAAACCG 


AGGCAGAGAG 


1560 


CTACGAGGGA 


CTCCTGGCAC 


CATCGCTGAT 


CCCAAAGAAC 


TGGCCAGACC 


AAG GG AAG AT 


1620 


C C AG AT C C AG 


AACCTGAGCG 


TGCGCTACGA 


CAGCTCCCTG 


AAG CCGGTGC 


TGAAGCACGT 


1680 


CAATGCCCTC 


ATCTCCCCTG 


GACAGAAGAT 


CGGGATCTGC 


GGCCGCACCG 


GCAGTGGGAA 


1740 


GT.CCTCCTTC 


TCTCTTGCCT 


TCTTCCGCAT 


GGTGGACACG 


TTCGAAGGGC 


ACATCATCAT 


1800 


TGATGGCATT 


GACATCGCCA 


AACTGCCGCT 


GCACACCCTG 


CGCTCACGCC 


T CT C C AT CAT 


1860 


CCTGCAGGAC 


CCCGTCCTCT 


TCAGCGGCAC 


CATCCGATTT 


AACCTGGACC 


CT G AG AG G AA 


1920 


GTGCTCAGAT 


AGCACACTGT 


GGGAGGCCCT 


GGAAATCGCC 


CAGCTGAAGC 


TGGTGGTGAA 


1980 


GGCACTGCCA 


GGAGGCCTCG 


ATGCCATCAT 


CACAGAAGGC 


GGGGAGAATT 


TCAGCCAGGG 


2040 


ACAGAGGCAG 


CTGTTCTGCC 


TGGCCCGGGC 


CTTCGTGAGG 


AAG A C C AG C A 


TCTTCATCAT 


2100 


GGACGAGGCC 


ACGGCTTCCA 


TTGACATGGC 


CACGGAAAAC 


ATCCTCCAAA 


AGGTGGTGAT 


2160 


GACAGCCTTC 


GCAGACCGCA 


CTGTGGTCAC 


CATCGCGCAT 


CGAGTGCACA 


CCATCCTGAG 


2220 
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TGCAGACCTG 


GTGATCGTCC 


TGAAGCGGGG 


TGCCATCCTT 


GAGTTCGATA 


AG C C AG AG AA 


2280 


GCTGCTCAGC 


CGGAAGGACA 


GCGTCTTCGC 


CTCCTTCGTC 


CGTGCAGACA 


AGTGACCTGC 


2340 


CAGAGCCCAA 


GTGCCATCCC 


AC AT T C GG AC 


CCTGCCCATA 


CCCCTGCCTG 


GGTTTTCTAA 


2400 


CTGTAAATCA 


CTTGTAAATA 


AATAGATTTG 


ATTATTTCCT 


AAAAAAAAAA 


AAAA 


2454 



(30) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:31: 

GATGACCCCT TCTCAGCTTT GGATGTCCAT CTGAGTGACC ACCTGATGCA GGCCGGCATC 60 

CTTGAGCTGC TCCGGGATGA CAAGAGGACA GTGGTCTTGG TGACCCACAA GCTACAGTAT 12 0 

CTGCCTCATG C AG AC TG G AT CATTGCCATG AAGGATGGGA CCATTCAGAG GGAAGGGACG 18 0 

CTCAAGGACT TCCAGAGGTC CGAGTGCCAG CTCTTTGAGC ACTGGAAGAC CCTCATGAAC 24 0 

CGGCAGGACC AAGAGCTGGA GAAGGAGACA GTCATGGAGA GGAAAGCCTC AG AG C CAT C T 3 00 

CAGGGCCTGC CCCGTGCCAT GTCCTCCAGA GACGGCCTTC TGCTGGATGA GGAAGAGGAG 36 0 

GAAGAGGAGG CAGCCGAAAG CGAGGAAGAT GACAACTTAT CTTCAGTGCT G CAT C AG C G A 42 0 

GCTAAGATCC CCTGGCGAGC CTGCACTAAG TATCTGTCCT CTGCTGGCAT TCTGCTCCTG 4 80 

TCCCTGCTTG TCTTCTCCCA GCTGCTCAAG CACATGGTCT TGGTGGCCAT TGATTATTGG 54 0 

CTGGCCAAGT GGACGGACAG TGCCCTGGTC CTGAGCCCCG CTGCCAGGAA CTGTTCGCTC 6 00 

AGCCAGGAAT GTGACCTGGA CCAGTCTGTC TATGCCATGG TATTCACCTT GCTCTGCAGC 6 60 

CTGGGTATCG TGCTGTGCCT GGTCACCTCT GTCACTGTGG AGTGGACGGG ACTGAAGGTG 72 0 

GCCAAG AG GC TACACCGCAG CCTGCTCAAC CGCATCATCC TGGCCCCCAT GAGGTTCTTT 78 0 

. GAGACCACAC CCCTCGGGAG TATCCTGAAC AGATTTTCAT CCGACTGTAA CACCATTGAC 84 0 

CAGCACATCC CATCCACGCT GGAGTGTCTG AGCCGGTCCA CCCTGCTGTG TGTCTCCGCC 90 0 

CTGACTGTCA TCTCCTATGT CACACCCGTG TTCCTCGTGG CCCTCTTACC CCTAGCTGTT 96 0 

GTGTGCTACT TCATTCAGAA GTACTTCCGA GTGGCATCCA GGGACCTGCA GCAGCTGGAC 102 0 

GACACGACGC AGCTCCCGCT CGTCTCACAC TTTGCTGAAA CTGTGGAGGG ACTCACCACC 10 8 0 

ATCCGTGCCT TCAGGTACGA GGCCCGGTTC CAGCAGAAGC TTCTAGAATA TACCGACTCC 114 0 

AACAA CAT CG CCTCCCTCTT CCTCACGGCA GCCAACAGA? GGCTGGAAGT CTGCATGGAG 12 0 0 

TACATCGGAG CGTGCGTGGT ACTCATTGCG GCTGCCACCT CCATCTCCAA CTCCCTGCAC -1260 
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AGGGAACTTT 


CTGCTGGCCT 


GGTGGGCCTG 


GGCCTCACCT 


ATGCCTTGAT 


GGTCTCCAAC 


1320 


TACCTCAACT 


GGATGGTGAG 


GAACCTGGCG 


GACATGGAGA 


TCCAGCTGGG 


GGCTGTGAAG 


1380 


AGGATCCACG 


CACTCCTGAA 


AACCGAGGCG 


GAGAGCTATG 


AGGGGCTCCT 


GGCGCCGTCG 


1440 


- TTGATCCCCA 


AGAACTGGCC 


AGACCAAGGG 


AAGATCCAAA 


TT C AG AAC CT 


GAGCGTGCGC 


1500 


TATGACAGCT 


CCCTGAAGCC 


AGTGCTGAAG 


CATGTCAACA 


CCCTCATCTC 


CCCGGGGCAG 


1560 


AAGATCGGGA 


TCTGCGGCCG 


CACAGGCAGC 


GGGAAGTCCT 


CCTTCTCCCT 


GGCCTTTTTC 


1620 


CGAATGGTGG 


ACATGTTTGA 


AGGACGCATC 


ATCATTGATG 


GCATCGACAT 


CGCCAAGCTG 


1680 


CCACTTCACA 


CGCTGCGCTC 


ACGCCTGTCC 


ATCATCCTAC 


AGGACCCCGT 


CCTCTTCAGC 


1740 


GGCACGATCA 


GATTCAACCT 


GGACCCCGAG 


AAGAAATGCT 


C AG AC AG C AC 


ACTGTGGGAG 


1800 


GCCCTGGAGA 


TCGCCCAGCT 


GAAGCTGGTA 


GTGAAGGCAC 


TGCCAGGAGG 


CCTAGATGCC 


1860 


ATCATCACAG 


AAGGAGGGGA 


GAATTTTAGC 


CAGGGCCAGA 


GGCAGCTGTT 


CTGCCTGGCC 


1920 


CGGGCCTTCG 


TGAGGAAGAC 


C AG CAT C TT C 


ATCATGGATG 


AAGCAACCGC 


CTCCATCGAC 


1980 


ATGGCTACGG 


AG AAC AT C CT 


CCAGAAGGTG 


GTGATGACAG 


CCTTCGCAGA 


CCGCACGGTG 


2040 


GTCACCATCG 


CGCATCGTGT 


GCACACCATC 


CTGAGTGCAG 


ACCTGGTGAT 


GGTCCTCAAG 


2100 


AGGGGTGCTA 


TCCTGGAGTT 


TGACAAGCCA 


GAGACGCTCC 


TCAGCCAGAA 


GGACAGCGTG 


2160 


TTCGCCTCCT 


TTGTCCGTGC 


GGACAAGTGA 


CTTACCGGAG 


CCAAAGTGCC 


ACCCCGCGCC 


2220 


TCGCTTGCTT 


GCCTAGGATT 


TCTAACTGCA 


AATCACTTGT 


AAATAAATTA 


ATTCTTTGCT 


2280 


AAAAAAAAAA 


AAAA 










2294 



(31) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5110 base pairs 

(B) TYPE : nucleic acid 

(C) STRANOEDNESS : sxngle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



CCCTTGTGAC 


AGGTCAGTCT 


TACGAGAATA 


TGGTAACTGA 


GATCATGTCA 


ATGGGCTATG 


60 


AACGAGAACA 


AGTAATTGCA 


GCCCTGAGAG 


CCAGCTTCAA 


CAACCCTGAT 


AGAGCTGTGG 


120 


AATATCTTCT 


AATGGGAATC 


CCTGGAGACT 


GAGGAGTTCC 


AGTACTCACA 


GCCTGTGGAG 


180 


GAGGATCAAC 


CACGGCCTGA 


CTTTCGCGGC 


CGCCGCGGGA 


GGCGCGCGGA 


GCCGGAGCCG 


240 


AGCCCGTGCG 


CGCGCCACCA 


TGCCTTTGGC 


CTTCTGCGGC 


ACCGAGAACC 


ACTCGGCCGC 


300 


CTACCGGGTG 


GACCAAGGCG 


TCCTCAACAA 


CGGCTGCTTC 


GTGGACGCGC 


TCAATGTGGT 


360 


GCCACATGTC 


TTTCTGCTCT 


TCATCACCTT 




TTCATCGGAT 


GGGGCAGCCA 


420 
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GAGCTCCAAG 


GTGCACATTC 


ACCACAGCAC 


CTGGCTCCAT 


TTCCCGGGGC 


ACAACCTGCG 


480 


CTGGATCCTG 


ACCTTCATAC 


TGCTCTTCGT 


CCTCGTGTGT 


GAGATCGCTG 


AGGGTATCCT 


540 


GTCTGACGGG 


GTGACAGAAT 


CCCGCCACCT 


CCACTTATAC 


ATGCCAGCTG 


GGATGGCATT 


600 


CATGGCTGCC 


ATCACCTCTG 


TGGTCTACTA 


C CAT AACATT 


GAGACCTCTA 


ACTTTCCCAA 


660 


GCTGCTGATT 


GCTCTGCTCA 


TCTACTGGAC 


CCTGGCCTTC 


AT C AC G AAG A 


CCATCAAGTT 


720 


CGTCAAGTTC 


TACGACCACG 


CCATTGGCTT 


CTCTCAGCTG 


CGCTTCTGCC 


TCACGGGGCT 


780 


TCTGGTGATC 


CTCTACGGGA 


TGCTGCTGCT 


TGTGGAGGTC 


AATGTGATCC 


GGGTGAGGAG 


840 


ATACGTCTTC 


TTCAAGACAC 


CAAGGGAAGT 


AAAGCCCCCC 


GAGGACCTAC 


AGGACCTGGG 


900 


TGTGCGCTTT 


CTGCAGCCCT 


TCGTTAACCT 


GCTATCAAAG 


GGGACCTACT 


GGTGGATGAA 


960 


TGCCTTCATC 


AAGACTGCTC 


AC AAG AAG C C 


CATCGACCTG 


CGGGCCATCG 


GGAAGCTGCC 


1020 


CATTGCCATG 


AGAGCCCTCA 


CCAACTACCA 


GCGACTCTGC 


TTGGCCTTCG 


ATGCCCAGGC 


1080 


GCGGAAGGAC 


ACAC AG AG C C 


AGCAGGGTGC 


CCGGGCCATC 


TGGAGGGCTC 


TCTGTCATGC 


1140 


CTTTGGGAGA 


CGGCTGGTCC 


TCAGCAGCAC 


ATTCCGTATC 


CTGGCCGACC 


TCCTGGGCTT 


1200 


TGCTGGGCCA 


CTCTGCATCT 


TCGGGATCGT 


GGACCACCTC 


GGGAAGGAGA 


ACCACGTCTT 


1260 


CCAGCCCAAG 


ACACAGTTTC 


TTGGAGTTTA 


CTTTGTCTCA 


TCCCAAGAGT 


TCCTCGGCAA 


1320 


TGCCTATGTC 


TTGGCTGTTC 


TTCTGTTCCT 


TGCCCTCCTG 


CTGCAAAGGA 


CCTTTCTACA 


1380 


AGCCTCGTAC 


TACGTTGCCA 


TTGAAACTGG 


GATCAACCTG 


AG AG GAG C AA 


TCCAGACCAA 


144 0 


GATTTACAAT 


AAG AT CAT G C 


ACTTGTCTAC 


TTCCAACCTG 


TCCATGGGGG 


AAATGACTGC 


1500 


TGGGCAGATC 


TGCAACCTGG 


TGGCCATCGA 


CACCAACCAG 


CTCATGTGGT 


TTTTCTTCTT 


1560 


ATGCCCAAAC 


CTCTGGGCTA 


TGCCGGTACA 


GATCATTGTG 


GGCGTGATCC 


TCCTCTACTA 


1620 


CATCCTTGGG 


GTCAGCGCCT 


TGATTGGAGC 


GGCTGTCATC 


ATTCTGCTGG 


CTCCTGTACA 


1680 


GTACTTTGTG 


GCCACCAAGC 


TGTCCCAGGC 


ACAGCGGACG 


ACCCTGGAAT 


ATTCCAATGA 


1740 


GAGGCTGAAG 


CAGACCAATG 


AGATGCTCCG 


GGGCATCAAG 


TTGCTCAAGC 


TCTATGCGTG 


1800 


GGAGAACATC 


TTCTGCTCCA 


GGGTGGAGAA 


GACACGCAGG 


AAGGAAATGA 


CCAGCCTCAG 


1860 


GGCCTTCGCT 


GTCTACACCT 


CCATCTCCAT 


CTTCATGAAC 


ACAGCTATCC 


CCATCGCTGC 


1920 


TGTCCTCATC 


ACCTTCGTGG 


GCCACGTCAG 


CTTCTTCAAA 


GAGTCGGACT 


TCTCGCCCTC 


1980 


GGTGGCCTTT 


GCCTCTCTCT 


CTCTCTTCCA 


CATCCTGGTC 


ACACCGCTGT 


TCCTGCTGTC 


2040 


TAGTGTGGTT 


CGGTCCACTG 


TCAAGGCCCT 


GGTGAGCGTG 


CAAAAGCTGA 


GTGAGTTCCT 


2100 


GTCCAGTGCA 


GAG AT C CG TG 


AGGAACAGTG 


TGCCCCCCGA 


GAGCCCGCAC 


CCCAAGGCCA 


2160 


AGCGGGCAAG 


TACCAGGCGG 


TGCCCCTCAA 


GGTCGTAAAC 


CGCAAGCGCC 


CAGCCCGAGA 


2220 


AGAAGTCCGG 


GACCTCTTGG 


GCCCACTGCA 


GAGGCTGACT 


CCCAGCACGG 


ATGGAGACGC 


2280 


TGACAACTTC 


TGTGTCCAGA 


TCATCGGAGG 


CTTCTTCACC 


TGGACCCCTG 


ATGGAATCCC 


2340 


CACCCTGTCC 


AACATCACCA 


TCCGTATCCC 


CCGAGGTCAG 


CTGACCATGA 


TCGTGGGGCA 


2400 


GGTGGGCTGT 


GGCAAGTCCT 


CGCTCCTTCT 


GGCCACCCTG 


GGGGAGATGC 


AGAAGGTCTC 


2460 
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TGGAGCTGTC 


TTCTGGAACA 


GCCTTCCAGA 


CAGCGAGGGG 


AGAAGACCCC 


AGCAACCCAG 


2520 


AGCGGGAGAC 


AGCGGCCGAT 


TCGGATGCCA 


GGAGCAGAGG 


CCCTGTGGCT 


ACGCATCTCA 


2580 


GAAACCATGG 


CTGCTAAATG 


CCACTGTGGA 


GGAGAACATC 


ACCTTCGAGA 


GTCCCTTCAA 


2640 


TAAGCAACGG 


TACAAGATGG 


TCATCGAAGC 


CTGCTCCCTG 


CAGCCAGACA 


TAGACATCCT 


2700 


GCCCCATGGA 


GACCAGACTC 


AGATTGGGGA 


ACGAGGCATC 


AACTTGAGTA 


CTGGTGGTCA 


2760 


GCGTCCAGAT 


C AG T G TAG AC 


CCGAGCCCTC 


TACCAGCACA 


CCAATGATTG 


TCTTTTTGGA 


2820 


TGACCCTTTC 


TCGGCTCTGG 


ATGTCCATCT 


GAGTGACCAC 


CTAATGCAGG 


CTGGCATCCT 


2880 


CGAGCTGCTC 


CGGGATGACA 


AG AG G AC AG T 


GGTCTTGGTG 


ACCCACAAGC 


TACAGTACCT 


2940 


GCCTCATGCT 


GACTGGATCA 


TTGCTATGAA 


GGATGGCACC 


ATTCAGAGGG 


AG GGG ACACT 


3000 


CAAGGACTTC 


CAGAGGTCTG 


AG TG C C AG CT 


CTTTGAGCAT 


TGGAAGACCC 


TCATGAACCG 


3060 


GCAGGACCAA 


GAGCTGGAGA 


AGGAGACAGT 


CATGGAGAGA 


AAAGCCCCAG 


AGCCATCTCA 


3120 


GGGCCrGCCC 


CGTGCCATGT 


CCTCAAGAGA 


TGGCCTTCTG 


CTGGATGAGG 


ATGAGGAGGA 


3180 


AGAGGAGGCA 


GCCGAGAGCG 


AGGAAGATGA 


CAACTTATCC 


TCTGTGCTGC 


ATCAGCGAGC 


3240 


CAAGATCCCA 


TGGCGAGCCT 


GCACCAAGTA 


TTTGTCCTCT 


GCTGGCATCC 


TGCTCCTGTC 


3300 


CCTGCTTGTC 


TTCTCCCAGC 


TGCTCAAGCA 


CATGGTCTTG 


GTGGCCATTG 


ACTACTGGCT 


3360 


GGCCAAGTGG 


ACGGACAGTG 


CCCTGGTCCT 


GAGCCCCGCC 


GCCAGGAACT 


GCTCCCTCAG 


3420 


CCAGGAATGT 


GCCCTGGACC 


AATCTGTCTA 


TGCCATGGTA 


TTCACCGTGC 


TCTGCAGCCT 


3480 


GGGTATCGCG 


CTGTGCCTTG 


TCACCTCTGT 


CACTGTGGAG 


TGGACGGGAC 


TGAAGGTGGC 


3540 


CAAGAGGCTG 


CATCGCAGCC 


TGCTCAACCG 


TATCATCCTG 


GCTCCCATGA 


GGTTCTTTGA 


3600 


GACCACGCCC 


CTGGGGAGTA 


TCCTGAACAG 


ATTTTCATCT 


GACTGTAACA 


CCATTGACCA 


3660 


GCATATCCCG 


TCCACGCTGG 


AGTGCCTGAG 


CAGATCCACC 


TTACTCTGTG 


TCTCCGCCCT 


3720 


GGCTGTCATC 


TCCTACGTCA 


CGCCTGTGTT 


CCTAGTGGCC 


CTCTTACCCC 


TCGCCGTCGT 


3780 


GTGCTACTTC 


ATCCAGAAGT 


ACTTCCGAGT 


GGCGTCCAGG 


GACCTGCAGC 


AGCTGGACGA 


3840 


CACAACACAG 


CTCCCTCTGC 


TCTCACACTT 


TGCTGAAACT 


GTGGAAGGAC 


TCACCACCAT 


3900 


CCGTGCCTTC 


AGGTACGAGG 


CCCGGTTCCA 


GCAGAAGCTC 


CTAGAGTACA 


CCGACTCCAA 


3960 


CAACATTGCC 


TCTCTCTTCC 


TCACAGCAGC 


CAACAGGTGG 


CTGGAAGTCC 


GCATGGAGTA 


4020 


CATCGGAGCA 


TGCGTGGTAC 


TCATCGCCGC 


TGCCACCTCC 


ATCTCCAACT 


CCCTACACAG 


4080 


GGAGCTCTCA 


GCCGGCCTAG 


TAGGCCTGGG 


CCTCACCTAT 


GCCTTGATGG 


TCTCCAACTA 


4140 


CCTCAACTGG 


ATGGTGAGGA 


ACCTGGCAGA 


CATGGAGATC 


CAACTGGGAG 


CTGTGAAGGG 


4200 


TATCCACACA 


CTCCTGAAAA 


CTGAGGCAGA 


G AG C TAT GAG 


GGGCTCCTGG 


CACCATCGCT 


4260 


GATCCCCAAG 


AACTGGCCAG 


ACCAAGGGAA 


GATCCAAATT 


CAAAAC CTGA 


GTGTACGCTA 


4320 


TGACAGCTCC 


CTGAAGCCCG 


TGCTGAAGCA 


CGTCAACGCC 


CTCATCTCCC 


CAGGACAGAA 


4380 


GATTGGGATC 


TGCGGCCGCA 


CAGGCAGTGG 


AAAATCCTCC 


TTCTCTCTCG 


CCTTTTTCCG 


4440 


AATGGTGGAT 


ATGTTTGAAG 


GGCGTATCAT 


CATCGATGGC 


ATTGACATCG 


CCAAGCTGCC 


4500 
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GCTGCACACG 


CTCGGCTCAC 


GCCTGTCTAT 


CATC CT AC AG 


GACCCTGTTC 


TCTTCAGTGG 


4560 


T AC CATC AG A 


TTCAACCTGG 


ACC CAGAGAA 


GAAATGCTCA 


G AC AG C AC G C 


TGTGGGAGGC 


4620 


TCTGGAGATC 


GCTCAGCTGA 


AGCTGGTGGT 


GAAGGCCCTG 


CCAGGAGGCC 


TG GAT G C CAT 


4680 


CATCACGGAA 


GGAGGGGAGA 


ATTTTAGCCA 


GGGCCAGAGG 


CAGCTGTTCT 


GCCTGGCCCG 


4740 


GGCCTTTGTG 


AGGAAGACCA 


GCATCTTCAT 


C AT GG AT G AA 


GCAACTGCCT 


C CATCGAC AT 


4800 


GGCTACGGAA 


AATATCCTCC 


AGAAGGTGGT 


GATGACAGCC 


TTCGCAGACC 


GCACCGTGGT 


4860 


CACCATCGCG 


CACCGCGTGC 


ACACCATCCT 


GAGTGCAGAC 


CTAGTGATGG 


TCCTGAAGAG 


4920 


GGGCGCGATC 


CTGGAGTTCG 


ACAAGCCGGA 


AAAGCTTCTC 


AGCCAGAAGG 


ACAGCGTCTT 


4980 


TGCCTCCTTT 


GTCCGCGCGG 


AC AAATGAC C 


AGCCAGCGCC 


AAAGTG CC AC 


CCCACACCTC 


5040 


ACCTGCTTGC 


CAT G G ATTT C 


TTACTGTAAA 


TCACTTGTAA 


ATAAAGAAAC 


TAATTCTTTG 


5100 


CTAAAAAAAA 












5110 



(32) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5110 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE : DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE; NO 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 260.. 5004 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

CCCTTGTGAC AGGTCAGTCT TACGAGAATA TGGTAACTGA GATCATGTCA ATGGGCTATG 6 0 

AACGAGAACA AGTAATTGCA GCCCTGAGAG CCAGCTTCAA CAACCCTGAT AG AG CT GT GG 12 0 

AATATCTTCT AATGCGAATC CCTGGAGACT GAGGAGTTCC AGTACTCACA GCCTGTGGAG ISO 

GAGGATCAAC CACGGCCTGA CTTTCGCGGC CGCCGCGGGA GGCGCGCGGA GCCGGAGCCG 24 0 

AGCCCGTGCG CGCGCCACC ATG CCT TTG GCC TTC TGC GGC ACC GAG AAC CAC 2 92 

Met Pro Leu Ala Phe Cys Gly Thr Glu Asn His 
15 10 

TCG GCC GCC TAC CGG GTG GAC CAA GGC GTC CTC AAC AAC GGC TGC TTC 34 0 

Ser Ala Ala Tyr Arg Val Asp Gin Gly Val Leu Asn Asn Gly Cys Phe 
15 20 25 



GTG GAC GCG CTC AAT GTG GTG CCA CAT GTC TTT CTG CTC TTC ATC ACC 38 8 

Val Asp Ala Leu Asn Val Val Pro His Val Phe Leu Leu Phe lie Thr 
30 35 40 

TTC CCC ATC CTC TTC ATC GGA TGG GGC AGC CAG AGC TCC AAG GTG CAC 436 
Phe Pro He Leu Phe He Gly Trp Gly Ser Gin Ser Ser Lys Val His 
45 50 55 
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ATT CAC CAC AGC ACC TGG CTC CAT TTC CCG GGG CAC AAC CTG CGC TGG 4 84 

lie His His ser Thr Trp Leu His Phe Pro Gly His Asn Leu Arg Trp 
60 65 70 75 

ATC CTG ACC TTC ATA CTG CTC TTC GTC CTC GTG TGT GAG ATC GCT GAG 53 2 

lie Leu Thr Phe He Leu Leu Phe Val Leu Val Cys Glu He Ala Glu 

80 85 90 

GGT ATC CTG TCT GAC GGG GTG ACA GAA TCC CGC CAC CTC CAC TTA TAC 58 0 

Gly He Leu Ser Asp Gly Val Thr Glu Ser Arg His Leu His Leu Tyr 
95 100 105 

ATG CCA GCT GGG ATG GCA TTC ATG GCT GCC ATC ACC TCT GTG GTC TAC 628 

Met Pro Ala Gly Met Ala Phe Met Ala Ala He Thr Ser Val Val Tyr 
110 115 120 

TAC CAT AAC ATT GAG ACC TCT AAC TTT CCC AAG CTG CTG ATT GCT CTG 676 

Tyr His Asn He Glu Thr Ser Asn Phe Pro Lys Leu Leu He Ala Leu 

125 130 135 

CTC ATC TAC TGG ACC CTG GCC TTC ATC ACG AAG ACC ATC AAG TTC GTC 724 

Leu He Tyr Trp Thr Leu Ala Phe He Thr Lys Thr He Lys Phe Val 
140 145 150 155 

AAG TTC TAC GAC CAC GCC ATT GGC TTC TCT CAG CTG CGC TTC TGC CTC 772 

Lys Phe Tyr Asp His Ala He Gly Phe Ser Gin Leu Arg Phe Cys Leu 

160 165 170 

ACG GGG CTT CTG GTG ATC CTC TAC GGG ATG CTG CTG CTT GTG GAG GTC 82 0 

Thr Gly Leu Leu Val He Leu Tyr Gly Met Leu Leu Leu Val Glu Val 
175 180 185 

AAT GTC ATC CGG GTG AGG AGA TAC GTC TTC TTC AAG ACA CCA AGG GAA 866 

Asn Val He Arg Val Arg Arg Tyr Val Phe Phe Lys Thr Pro Arg Glu 
190 195 200 

GTA AAG CCC CCC GAG GAC CTA CAG GAC CTG GGT GTG CGC TTT CTG CAG 916 

Val Lys Pro Pro Glu Asp Leu Gin Asp Leu Gly Val Arg Phe Leu Gin 

205 210 215 

CCC TTC GTT AAC CTG CTA TCA AAG GGG ACC TAC TGG TGG ATG AAT GCC 96 4 

Pro Phe Val Asn Leu Leu Ser Lys Gly Thr Tyr Trp Trp Met Asn Ala 
220 225 230 235 

TTC ATC AAG ACT GCT CAC AAG AAG CCC ATC GAC CTG CGG GCC ATC GGG 1012 

Phe He Lys Thr Ala His Lys Lys Pro He Asp Leu Arg Ala He Gly 

240 245 250 

AAG CTG CCC ATT GCC ATG AGA GCC CTC ACC AAC TAC CAG CGA CTC TGC 106 0 

Lys Leu Pro He Ala Met Arg Ala Leu Thr Asn Tyr Gin Arg Leu Cys 
255 260 265 

TTG GCC TTC GAT GCC CAG GCG CGG AAG GAC ACA CAG AGC CAG CAG GGT 110 8 

Leii Ala Phe Asp Ala Gin Ala Arg Lys Asp Thr Gin Ser Gin Gin Gly 
270 275 280 

GCC CGG GCC ATC TGG AGG GCT CTC TGT CAT GCC TTT GGG AGA CGG CTG 1156 

Ala Arg Ala He Trp Arg Ala Leu Cys His Ala Phe Gly Arg Arg Leu 

285 290 295 

GTC CTC AGC AGC ACA TTC CGT ATC CTG GCC GAC CTC CTG GGC TTT GCT 12 04 

Val Leu Ser Ser Thr Phe Arg He Leu Ala Asp Leu Leu Gly Phe Ala 
300 305 310 315 

GGG CCA CTC TGC ATC TTC GGG ATC GTG GAC CAC CTC GGG AAG GAG AAC 12 5 2 

Gly Pro Leu Cys He Phe Gly He Val Asp His Leu Gly Lys Glu Asn 

320 * 325 330 
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CAC GTC TTC CAG CCC AAG ACA CAG TTT CTT GGA GTT TAC TTT GTC TCA 13 00 

His Val Phe Gin Pro Lys Thr Gin Phe Leu Gly Val Tyr Phe Val Ser 
335 340 345 

TCC CAA GAG TTC CTC GGC AAT GCC TAT GTC TTG GCT GTT CTT CTG TTC 1348 
Ser Gin Glu Phe Leu Gly Asn Ala Tyr Val Leu Ala Val Leu Leu Phe 
350 355 360 

CTT GCC CTC CTG CTG CAA AGG ACC TTT CTA CAA GCC TCG TAC TAC GTT 13 96 

Leu Ala Leu Leu Leu Gin Arg Thr Phe Leu Gin Ala Ser Tyr Tyr Val 
365 370 375 

GCC ATT GAA ACT GGG ATC AAC CTG AGA GGA GCA ATC CAG ACC AAG ATT 1444 
Ala lie Glu Thr Gly lie Asn Leu Arg Gly Ala He Gin Thr Lys He 
380 385 390 395 

TAC AAT AAG ATC ATG CAC TTG TCT ACT TCC AAC CTG TCC ATG GGG GAA 14 92 

Tyr Asn Lys He Met His Leu Ser Thr Ser Asn Leu Ser Met Gly Glu 
400 405 410 

ATG ACT GCT GGG CAG ATC TGC AAC CTG GTG GCC ATC GAC ACC AAC CAG 154 0 

Met Thr Ala Gly Gin He Cys Asn Leu Val Ala He Asp Thr Asn Gin 
415 420 425 

CTC ATG TGG TTT TTC TTC TTA TGC CCA AAC CTC TGG GCT ATG CCG GTA 1588 
Leu Met Trp Phe Phe Phe Leu Cys Pro Asn Leu Trp Ala Met Pro Val 
430 435 440 

CAG ATC ATT GTG GGC GTG ATC CTC CTC TAC TAC ATC CTT GGG GTC AGC 163 6 

Gin He He Val Gly Val He Leu Leu Tyr Tyr He Leu Gly Val Ser 
445 450 455 

GCC TTG ATT GGA GCG GCT GTC ATC ATT CTG CTG GCT CCT GTA CAG TAC 16 84 

Ala Leu He Gly Ala Ala Val He He Leu Leu Ala Pro Val Gin Tyr 
460 465 470 475 

TTT GTG GCC ACC AAG CTG TCC CAG GCA CAG CGG ACG ACC CTG GAA TAT 173 2 

Phe Val Ala Thr Lys Leu Ser Gin Ala Gin Arg Thr Thr Leu Glu Tyr 
480 485 490 

TCC AAT GAG AGG CTG AAG CAG ACC AAT GAG ATG CTC CGG GGC ATC AAG 178 0 

Ser Asn Glu Arg Leu Lys Gin Thr Asn Glu Met Leu Arg Gly He Lys 
495 500 505 

TTG CTC AAG CTC TAT GCG TGG GAG AAC ATC TTC TGC TCC AGG GTG GAG 182 8 

Leu Leu Lys Leu Tyr Ala Trp Glu Asn He Phe Cys Ser Arg Val Glu 
510 * 515 520 

AAG ACA CGC AGG AAG GAA ATG ACC AGC CTC AGG GCC TTC GCT GTC TAC 18 7 6 

Lys Thr Arg Arg Lys Glu Met Thr Ser Leu Arg Ala Phe Ala Val Tyr 
525 530 535 

ACC TCC ATC TCC ATC TTC ATG AAC ACA GCT ATC CCC ATC GCT GCT GTC 1924 
Thr Ser He Ser He Phe Met Asn Thr Ala He Pro He Ala Ala Val 
540 545 550 555 

CTC ATC ACC TTC GTG GGC CAC GTC AGC TTC TTC AAA GAG TCG GAC TTC 19 72 

Leu lie Thr Phe Val Gly His Val Ser Phe Phe Lys Glu Ser Asp Phe 
560 565 570 

TCG CCC TCG GTG GCC TTT GCC TCT CTC TCT CTC TTC CAC ATC CTG GTC 2 02 0 

Ser Pro Ser Val Ala Phe Ala Ser Leu Ser Leu Phe His He Leu Val 
575 580 585 

ACA CCG CTG TTC CTG CTG TCT AGT GTG GTT CGG TCC ACT GTC AAG GCC 2 06 8 

Thr Pro Leu Phe Leu Leu Ser Ser Val Val Arg Ser Thr Val Lys Ala 
590 595 600 
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CTG GTG AGC GTG CAA AAG CTG AGT GAG TTC CTG TCC AGT GCA GAG ATC 2116 
Leu Val Ser Val Gin Lys Leu Ser Glu Phe Leu Ser Ser Ala Glu lie 
605 610 615 

CGT GAG GAA CAG TGT GCC CCC CGA GAG CCC GCA CCC CAA GGC CAA GCG 2164 
Arg Glu Glu Gin Cys Ala pro Arg Glu Pro Ala Pro Gin Gly Gin Ala 
620 625 630 635 

GGC AAG TAC CAG GCG GTG CCC CTC AAG GTC GTA AAC CGC AAG CGC CCA 2 212 

Gly Lys Tyr Gin Ala Val Pro Leu Lys Val Val Asn Arg Lys Arg Pro 
640 645 650 

GCC CGA GAA GAA GTC CGG GAC CTC TTG GGC CCA CTG CAG AGG CTG ACT 2260 
Ala Arg Glu Glu Val Arg Asp Leu Leu Gly Pro Leu Gin Arg Leu Thr 
655 660 665 

CCC AGC ACG GAT GGA GAC GCT GAC AAC TTC TGT GTC CAG ATC ATC GGA 23 08 

Pro Ser Thr Asp Gly Asp Ala Asp Asn Phe Cys Val Gin lie lie Gly 
670 675 680 

GGC TTC TTC ACC TGG ACC CCT GAT GGA ATC CCC ACC CTG TCC AAC ATC 2 3 56 

Gly Phe Phe Thr Trp Thr pro Asp Gly He Pro Thr Leu Ser Asn He 
685 690 695 

ACC ATC CGT ATC CCC CGA GGT CAG CTG ACC ATG ATC GTG GGG CAG GTG 2 4 04 

Thr He Arg He Pro Arg Gly Gin Leu Thr Met lie Val Gly Gin Val 
700 705 710 715 

GGC TGT GGC AAG TCC TCG CTC CTT CTG GCC ACC CTG GGG GAG ATG CAG 24 52 

Gly Cys Gly Lys Ser Ser Leu Leu Leu Ala Thr Leu Gly Glu Met Gin 
720 725 730 

AAG GTC TCT GGA GCT GTC TTC TGG AAC AGC CTT CCA GAC AGC GAG GGG 2 5 00 

Lys Val Ser Gly Ala Val Phe Trp Asn Ser Leu Pro Asp Ser Glu Gly 
735 740 745 

AGA AGA CCC CAG CAA CCC AGA GCG GGA GAC AGC GGC CGA TTC GGA TGC 2 54 8 

Arg Arg Pro Gin Gin Pro Arg Ala Gly Asp Ser Gly Arg Phe Gly Cys 
750 755 760 

CAG GAG CAG AGG CCC TGT GGC TAC GCA TCT CAG AAA CCA TGG CTG CTA 2 5 96 

Gin Glu Gin Arg Pro Cys Gly Tyr Ala Ser Gin Lys Pro Trp Leu Leu 
765 770 775 

AAT GCC ACT GTG GAG GAG AAC ATC ACC TTC GAG AGT CCC TTC AAT AAG 2 644 

Asn Ala Thr Val Glu Glu Asn He Thr Phe Glu Ser Pro Phe Asn Lys 
780 785 790 795 

CAA CGG TAC AAG ATG GTC ATC GAA GCC TGC TCC CTG CAG CCA GAC ATA 2 6 92 

Gin Arg Tyr Lys Met Val He Glu Ala Cys Ser Leu Gin Pro Asp He 
800 805 810 

GAC ATC CTG CCC CAT GGA GAC CAG ACT CAG ATT GGG GAA CGA GGC ATC 2 74 0 

Asp He Leu Pro His Gly Asp Gin Thr Gin He Gly Glu Arg Gly He 
815 820 825 

AAC TTG AGT ACT GGT GGT CAG CGT CCA GAT CAG TGT AGA CCC GAG CCC 2 7 88 

Asn Leu Ser Thr Gly Gly Gin Arg Pro Asp Gin Cys Arg Pro Glu Pro 
830 835 840 

TCT ACC AGC AC A CCA ATG ATT GTC TTT TTG GAT GAC CCT TTC TCG GCT 2836 
Ser Thr Ser Thr Pro Met He Val Phe Leu Asp Asp Pro- Phe Ser Ala 
845 850 " 855 

CTG GAT GTC CAT CTG AGT GAC CAC CTA ATG CAG GCT GGC ATC CTC GAG 2 8 84 

Leu Asp Val His Leu Ser Asp His Leu Met Gin Ala Gly He Leu Glu 
860 865 870 875 
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CTG CTC CGG GAT GAC AAG AGG ACA GTG GTC TTG GTG ACC CAC AAG CTA 2 932 

Leu Leu Arg Asp Asp Lys Arg Thr val Val Leu Val Thr His Lys Leu 
880 885 890 

CAG TAG CTG CCT CAT GCT GAC TGG ATC ATT GCT ATG AAG GAT GGC ACC 2 98 0 

Gin Tyr Leu Pro His Ala Asp Trp He He Ala Met Lys Asp Gly Thr 
895 900 905 

ATT CAG AGG GAG GGG ACA CTC AAG GAC TTC CAG AGG TCT GAG TGC CAG 3 028 

He Gin Arg Glu Gly Thr Leu Lys Asp Phe Gin Arg Ser Glu Cys Gin 
910 915 920 

CTC TTT GAG CAT TGG AAG ACC CTC ATG AAC CGG CAG GAC CAA GAG CTG 3 076 

Leu Phe Glu His Trp Lys Thr Leu Met Asn Arg Gin Asp Gin Glu Leu 
925 930 935 

GAG AAG GAG ACA GTC ATG GAG AGA AAA GCC CCA GAG CCA TCT CAG GGC 3124 
Glu Lys Glu Thr Val Met Glu Arg Lys Ala Pro Glu Pro Ser Gin Gly 
940 945 950 955 

CTG CCC CGT GCC ATG TCC TCA AGA GAT GGC CTT CTG CTG GAT GAG GAT 3172 
Leu Pro Arg Ala Met Ser Ser Arg Asp Gly Leu Leu Leu Asp Glu Asp 
960 965 970 

GAG GAG GAA GAG GAG GCA GCC GAG AGC GAG GAA GAT GAC AAC TTA TCC 3220 
Glu Glu Glu Glu Glu Ala Ala Glu Ser Glu Glu Asp Asp Asn Leu Ser 
975 980 985 

TCT GTG CTG CAT CAG CGA GCC AAG ATC CCA TGG CGA GCC TGC ACC AAG 32 68 

Ser Val Leu His Gin Arg Ala Lys He Pro Trp Arg Ala Cys Thr Lys 
990 995 1000 

TAT TTG TCC TCT GCT GGC ATC CTG CTC CTG TCC CTG CTT GTC TTC TCC 3316 
Tyr Leu Ser Ser Ala Gly He Leu Leu Leu Ser Leu Leu Val Phe Ser 
1005 1010 1015 

CAG CTG CTC AAG CAC ATG GTC TTG GTG GCC ATT GAC TAC TGG CTG GCC 33 64 

Gin Leu Leu Lys His Met Val Leu Val Ala lie Asp Tyr Trp Leu Ala 
1020 1025 1030 1035 

AAG TGG ACG GAC AGT GCC CTG GTC CTG AGC CCC GCC GCC AGG AAC TGC 3412 
Lys Trp Thr Asp Ser Ala Leu Val Leu Ser Pro Ala Ala Arg Asn Cys 
1040 1045 1050 

TCC CTC AGC CAG GAA TGT GCC CTG GAC CAA TCT GTC TAT GCC ATG GTA 34 60 

Ser Leu Ser Gin Glu Cys Ala Leu Asp Gin Ser Val Tyr Ala Met Val 
1055 1060 1065 

TTC ACC GTG CTC TGC AGC CTG GGT ATC GCG CTG TGC CTT GTC ACC TCT 3 5 08 

Phe Thr Val Leu Cys Ser Leu Gly He Ala Leu Cys Leu Val Thr Ser 
1070 1075 1080 

GTC ACT GTG GAG TGG ACG GGA CTG AAG GTG GCC AAG AGG CTG CAT CGC 3 556 

'Val Thr Val Glu Trp Thr Gly Leu Lys Val Ala Lys Arg Leu His Arg 
1085 1090 1095 

AGC CTG CTC AAC CGT ATC ATC CTG GCT CCC ATG AGG TTC TTT GAG ACC 3 6 04 

Ser Leu Leu Asn Arg He He Leu Ala Pro Met Arg Phe Phe Glu Thr 
1100 1105 HIO 1115 

ACG CCC CTG GGG AGT ATC CTG AAC AGA TTT TCA TCT GAC TGT AAC ACC 3 652 

Thr Pro Leu Gly Ser He Leu Asn Arg Phe Ser Ser Asp Cys Asn Thr 
1120 H25 1130 

ATT GAC CAG CAT ATC CCG TCC ACG CTG GAG TGC CTG AGC AGA TCC ACC 370 0 

He Asp Gin His He Pro Ser Thr Leu Glu Cys Leu Ser Arg Ser Thr 
1135 H40 1145 
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TTA CTC TGT GTC TCC GCC CTG GCT GTC ATC TCC TAC GTC ACG CCT GTG 3 748 

Leu Leu Cys Val Ser Ala Leu Ala Val He Ser Tyr Val Thr Pro Val 

li50 1155 1160 

TTC CTA GTG GCC CTC TTA CCC CTC GCC GTC GTG TGC TAC TTC ATC CAG 3 7 96 

Phe Leu Val Ala Leu Leu Pro Leu Ala Val Val Cys Tvr Phe lie Gin 
1165 1170 1175 

AAG TAC TTC CGA GTG GCG TCC AGG GAC CTG CAG CAG CTG GAC GAC ACA 3 844 

Lys Tyr Phe Arg Val Ala Ser Arg Asp Leu Gin Gin Leu Asp Asp Thr 
1180 1185 1190 1195 

ACA CAG CTC CCT CTG CTC TCA CAC TTT GCT GAA ACT GTG GAA GGA CTC 3892 

Thr Gin Leu Pro Leu Leu Ser His Phe Ala Glu Thr Val Glu Gly Leu 
1200 1205 1210 

ACC ACC ATC CGT GCC TTC AGG TAC GAG GCC CGG TTC CAG CAG AAG CTC 3 94 0 

Thr Thr He Arg Ala Phe Arg Tyr Glu Ala Arg Phe Gin Gin Lys Leu 
1215 1220 1225 

CTA GAG TAC ACC GAC TCC AAC AAC ATT GCC TCT CTC TTC CTC ACA GCA 3 988 

Leu Glu Tyr Thr Asp Ser Asn Asn He Ala Ser Leu Phe Leu Thr Ala 

1230 1235 1240 

GCC AAC AGG TGG CTG GAA GTC CGC ATG GAG TAC ATC GGA GCA TGC GTG 4 036 

Ala Asn Arg Trp Leu Glu Val Arg Met Glu Tyr He Gly Ala Cys Val 
1245 1250 1255 

GTA CTC ATC GCC GCT GCC ACC TCC ATC TCC AAC TCC CTA CAC AGG GAG 4 084 

Val Leu He Ala Ala Ala Thr Ser He Ser Asn Ser Leu His Arg Glu 
1260 1265 1270 1275 

CTC TCA GCC GGC CTA GTA GGC CTG GGC CTC ACC TAT GCC TTG ATG GTC 413 2 

Leu Ser Ala Gly Leu Val Gly Leu Gly Leu Thr Tyr Ala Leu Met Val 
1280 1285 1290 

TCC AAC TAC CTC AAC TGG ATG GTG AGG AAC CTG GCA GAC ATG GAG ATC 418 0 

Ser Asn Tyr Leu Asn Trp Met Val Arg Asn Leu Ala Asp Met Glu He 
1295 1300 1305 

CAA CTG GGA GCT GTG AAG GGT ATC CAC ACA CTC CTG AAA ACT GAG GCA 422 8 

Gin Leu Gly Ala Val Lys Gly He His Thr Leu Leu Lys Thr Glu Ala 

1310 1315 1320 

GAG AGC TAT GAG GGG CTC CTG GCA CCA TCG CTG ATC CCC AAG AAC TGG 42 76 

Glu Ser Tyr Glu Gly Leu Leu Ala Pro Ser Leu lie Pro Lys Asn Trp 
1325 1330 1335 

CCA GAC CAA GGG AAG ATC CAA ATT CAA AAC CTG AGT GTA CGC TAT GAC 4 3 24 

Pro Asp Gin Gly Lys He Gin He Gin Asn Leu Ser Val Arg Tyr Asp 
1340 1345 1350 1355 

AGC TCC CTG AAG CCC GTG CTG AAG CAC GTC AAC GCC CTC ATC TCC CCA 4 3 72 

Ser Ser Leu Lys Pro Val Leu Lys His Val Asn Ala Leu He Ser Pro 
1360 1365 1370 

GGA CAG AAG ATT GGG ATC TGC GGC CGC ACA GGC AGT GGA AAA TCC TCC 44 2 0 

Gly Gin Lvs He Gly He Cys Gly Arg Thr Gly Ser Gly Lys Ser Ser 
1375 1380 1385 

TTC TCT CTC GCC TTT TTC CGA ATG GTG GAT ATG TTT GAA GGG CGT ATC 44 6 8 

Phe Ser Leu Ala Phe Phe Arg Met Val Asp Met Phe Glu Gly Arg He 

1390 1395 1400 

ATC ATC GAT GGC ATT GAC ATC GCC AAG CTG CCG CTG CAC ACG CTC GGC 4 516 

He He Asp Gly He Asp He Ala Lys Leu Pro Leu His Thr Leu Gly 
1405 " ' 1^10 ' 1415 
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TCA CGC CTG TCT ATC ATC CTA CAG GAC CCT GTT CTC TTC AGT GGT ACC 4 5 64 

Ser Arg Leu Ser He He Leu Gin Asp Pro Val Leu Phe Ser Gly Thr 
1420 1425 1430 1435 

ATC AGA TTC AAC CTG GAC CCA GAG AAG AAA TGC TCA GAC AGC ACG CTG 4 612 

He Arg Phe Asn Leu Asp Pro Glu Lys Lys Cys Ser Asp Ser Thr Leu 
1440 1445 1450 

TGG GAG GCT CTG GAG ATC GCT CAG CTG AAG CTG GTG GTG AAG GCC CTG 4660 
Trp Glu Ala Leu Glu He Ala Gin Leu Lys Leu Val Val Lys Ala Leu 
1455 1460 1465 

CCA GGA GGC CTG GAT GCC ATC ATC ACG GAA GGA GGG GAG AAT TTT AGC 47 08 

Pro Gly Gly Leu Asp Ala He He Thr Glu Gly Gly Glu Asn Phe Ser 
1470 1475 1480 

CAG GGC CAG AGG CAG CTG TTC TGC CTG GCC CGG GCC TTT GTG AGG AAG 4 7 56 

Gin Gly Gin Arg Gin Leu Phe Cys Leu Ala Arg Ala Phe Val Arg Lys 
1485 1490 1495 

ACC AGC ATC TTC ATC ATG GAT GAA GCA ACT GCC TCC ATC GAC ATG GCT 48 04 

Thr Ser He Phe He Met Asp Glu Ala Thr Ala Ser He Asp Met Ala 
1500 1505 1510 1515 

ACG GAA AAT ATC CTC CAG AAG GTG GTG ATG ACA GCC TTC GCA GAC CGC 48 52 

Thr Glu Asn He Leu Gin Lys Val Val Met Thr Ala Phe Ala Asp Arg 
1520 1525 1530 

ACC GTG GTC ACC ATC GCG CAC CGC GTG CAC ACC ATC CTG AGT GCA GAC 4 9 00 

Thr Val Val Thr He Ala His Arg Val His Thr He Leu Ser Ala Asp 
1535 1540 1545 

CTA GTG ATG GTC CTG AAG AGG GGC GCG ATC CTG GAG TTC GAC AAG CCG 4 94 8 

Leu Val Met Val Leu Lys Arg Gly Ala He Leu Glu Phe Asp Lys Pro 
1550 1555 1560 

GAA AAG CTT CTC AGC CAG AAG GAC AGC GTC TTT GCC TCC TTT GTC CGC 4 9 96 

Glu Lys Leu Leu Ser Gin Lys Asp Ser Val Phe Ala Ser Phe Val Arg 
1565 1570 1575 

GCG GAC AA ATGAC CAG CC AGCGCCAAAG TGCCACCCCA CACCTCACCT GCTTGCCATG 5 054 

Ala Asp 

1580 

GATTTCTTAC TGTAAATCAC TTGTAAATAA AGAAACTAAT TCTTTGCTAA AAAAAA 5110 

(33) INFORMATION FOR SEQ ID NO -.34: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1581 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Met Pro Leu Ala Phe Cys Gly Thr Glu Asn His Ser Ala Ala Tyr Arg 
15 10 15 

Val Asp Gin Gly Val Leu Asn Asn Gly Cys Phe Val Asp Ala Leu Asn 
20 25 30 

Val Val Pro His Val Phe Leu Leu Phe He Thr Phe Pro He Leu Phe 
35 40 45 
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lie Gly Trp Gly Ser Gin Ser Ser Lys Val His lie His His Ser Thr 
50 55 ' 60 

Trp Leu His Phe Pro Gly His Asn Leu Arg Trp lie Leu Thr Phe lie 
65 70 75 80 

Leu Leu Phe Val Leu Val Cys Glu lie Ala Glu Gly lie Leu Ser Asp 
85 90 95 

Gly Val Thr Glu Ser Arg His Leu His Leu Tyr Met Pro Ala Gly Met 
100 105 110 

Ala Phe Met Ala Ala lie Thr Ser Val Val Tyr Tyr His Asn lie Glu 

115 120 125 

Thr Ser Asn Phe Pro Lys Leu Leu lie Ala Leu Leu lie Tyr Trp Thr 
130 135 140 

Leu Ala Phe He Thr Lys Thr He Lys Phe Val Lys Phe Tyr Asp His 
145 150 * 155 160 

Ala He Gly Phe Ser Gin Leu Arg Phe Cys Leu Thr Gly Leu Leu Val 
165 170 175 

lie Leu Tyr Gly Met Leu Leu Leu Val Glu Val Asn Val He Arg Val 
180 185 190 

Arg Arg Tyr Val Phe Phe Lys Thr Pro Arg Glu Val Lys Pro Pro Glu 

195 200 205 

Asp Leu Gin Asp Leu Gly Val Arg Phe Leu Gin Pro Phe Val Asn Leu 
210 215 220 

Leu Ser Lys Gly Thr Tyr Trp Trp Met Asn Ala Phe He Lys Thr Ala 
22S 230 235 240 

His Lys Lys Pro lie Asp Leu Arg Ala He Gly Lys Leu Pro He Ala 
245 250 255 

Met Arg Ala Leu Thr Asn Tyr Gin Arg Leu Cys Leu Ala Phe Asp Ala 
260 265 270 

Gin Ala Arg Lys Asp Thr Gin Ser Gin Gin Gly Ala Arg Ala lie Trp 

275 28 0 285 

Arg Ala Leu Cys His Ala Phe Gly Arg Arg Leu Val Leu Ser Ser Thr 
290 295 300 

Phe Arg lie Leu Ala Asp Leu Leu Gly Phe Ala Gly Pro Leu Cys lie 
305 310 315 320 

Phe Gly He Val Asp His Leu Gly Lys Glu Asn His val Phe Gin Pro 
325 * 330 335 

Lys Thr Gin Phe Leu Gly Val Tyr Phe Val Ser Ser Gin Glu Phe Leu 
340 345 350 

Gly Asn Ala Tyr Val Leu Ala Val Leu Leu Phe Leu Ala Leu Leu Leu 

355 360 365 

Gin Arg Thr Phe Leu Gin Ala Ser Tyr Tyr Val Ala He Glu Thr Gly 
370 375 380 

lie Asn Leu Arg Gly Ala lie Gin Thr Lys lie Tyr Asn Lys lie Met 
385 390 395 400 

His Leu Ser Thr Ser Asn Leu Ser Met Gly Glu t-'ez Thr Ala Gly Gin 
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405 410 415 

lie Cys Asn Leu Val Ala He Asp Thr Asn Gin Leu Met Trp Phe Phe 
420 425 430 

Phe Leu Cys Pro Asn Leu Trp Ala Met Pro Val Gin He He Val Gly 
435 440 445 

Val He Leu Leu Tyr Tyr He Leu Gly Val Ser Ala Leu He Gly Ala 
450 455 460 

Ala Val He lie Leu Leu Ala Pro Val Gin Tyr Phe Val Ala Thr Lys 
465 470 475 480 

Leu Ser Gin Ala Gin Arg Thr Thr Leu Glu Tyr Ser Asn Glu Arg Leu 
485 490 495 

Lys Gin Thr Asn Glu Met Leu Arg Gly He Lys Leu Leu Lys Leu Tyr 
500 505 510 

Ala Trp Glu Asn He Phe Cys Ser Arg Val Glu Lys Thr Arg Arg Lys 
515 520 525 

Glu Met Thr Ser Leu Arg Ala Phe Ala Val Tyr Thr Ser He Ser He 
530 535 540 

Phe Met Asn Thr Ala He Pro He Ala Ala Val Leu He Thr Phe Val 
545 550 555 560 

Gly His Val Ser Phe Phe Lys Glu Ser Asp Phe Ser Pro Ser Val Ala 
565 570 575 

Phe Ala Ser Leu Ser Leu Phe His He Leu Val Thr Pro Leu Phe Leu 
580 585 590 

Leu Ser Ser Val Val Arg Ser Thr Val Lys Ala Leu Val Ser Val Gin 
595 600 605 

Lys Leu Ser Glu Phe Leu Ser Ser Ala Glu lie Arg Glu Glu Gin Cys 
610 615 620 

Ala Pro Arg Glu Pro Ala Pro Gin Gly Gin Ala Gly Lys Tyr Gin Ala 
625 630 635 640 

Val Pro Leu Lys Val Val Asn Arg Lys Arg. Pro Ala Arg Glu Glu Val 
645 650 655 

Arg Asp Leu Leu Gly Pro Leu Gin Arg Leu Thr Pro Ser Thr Asp Gly 
660 665 670 

Asp Ala Asp Asn Phe Cys Val Gin He He Gly Gly Phe Phe Thr Trp 
675 680 685 

Thr Pro Asp Gly He Pro Thr Leu Ser Asn lie Thr He Arg He Pro 
690 695 700 

Arg Gly Gin Leu Thr Met He Val Gly Gin Val Gly Cys Gly Lys Ser 
705 710 715 720 

Ser Leu Leu Leu Ala Thr Leu Gly Glu Met Gin Lys Val Ser Gly Ala 
725 730 735 

Val Phe Trp Asn Ser Leu Pro Asp Ser Glu Gly Arg Arg Pro Gin Gin 
740 745 750 

Pro Arg Ala Gly Asp Ser Gly Arg Phe Gly Cys Gin Glu Gin Arg Pro 
755 760 765 
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Cys Gly Tyr Ala Ser Gin Lys Pro Trp Leu Leu Asn Ala Thr Val Glu 
770 775 780 

Glu Asn lie Thr Phe Glu Ser Pro Phe Asn Lys Gin Arg Tyr Lys Met 
785 790 795 800 

Val lie Glu Ala Cys Ser Leu Gin Pro Asp lie Asp lie Leu Pro His 
805 810 815 

Gly Asp Gin Thr Gin lie Gly Glu Arg Gly lie Asn Leu Ser Thr Gly 
820 825 830 

Gly Gin Arg Pro Asp Gin Cys Arg Pro Glu Pro Ser Thr Ser Thr Pro 
835 840 845 

Met lie Val Phe Leu Asp Asd Pro Phe Ser Ala Leu Asp Val His Leu 
850 855 860 

Ser Asp His Leu Met Gin Ala Gly lie Leu Glu Leu Leu Arg Asp Asp 
865 870 875 880 

Lys Arg Thr Val Val Leu Val Thr His Lys Leu Gin Tyr Leu Pro His 
885 890 895 

Ala Asp Trp He lie Ala Met Lys Asp Gly Thr He Gin Arg Glu Gly 
900 905 910 

Thr Leu Lys Asp Phe Gin Arg Ser Glu Cys Gin Leu Phe Glu His Trp 
915 920 925 

Lys Thr Leu Met Asn Arg Gin Asp Gin Glu Leu Glu Lys Glu Thr Val 
930 935 940 

Met Glu Arg Lys Ala Pro Glu Pro Ser Gin Gly Leu Pro Arg Ala Met 
945 950 955 960 

Ser Ser Arg Asp Gly Leu Leu Leu Asp Glu Asp Glu Glu Glu Glu Glu 
965 970 975 

Ala Ala Glu Ser Glu Glu Asp Asp Asn Leu Ser Ser Val Leu His Gin 
980 985 990 

Arg Ala Lys He Pro Trp Arg Ala Cys Thr Lys Tyr Leu Ser Ser Ala 
995 1000 1005 

Gly He Leu Leu Leu Ser Leu Leu Val Phe Ser Gin Leu Leu Lys His 
1010 1015 1020 

Met Val Leu Val Ala He Asp Tyr Trp Leu Ala Lys Trp Thr Asp Ser 
1025 1030 1035 1040 

Ala Leu Val Leu Ser Pro Ala Ala Arg Asn Cys Ser Leu Ser Gin Glu 
1045 1050 1055 

Cys Ala Leu Asp Gin Ser Val Tyr Ala Met Val Phe Thr Val Leu Cys 
1060 1065 1070 

Ser Leu Gly He Ala Leu Cys Leu Val Thr Ser Val Thr Val Glu Trp 
1075 ' 1080 1085 

Thr Gly Leu Lys Val Ala Lys Arg Leu His Arg Ser Leu Leu Asn Arg 
1090 1095 ~ 1100 

He lie Leu Ala Pro Met Arg Phe Phe Glu Thr Thr Pro Leu Gly Ser 
1105 1110 " 1115 1120 

lie Leu Asn Arg Phe Ser Ser Asp Cys Asn Thr lie Asp Gin His He 
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1125 1130 1135 

Pro Ser Thr Leu Glu Cys Leu Ser Arg Ser Thr Leu Leu Cys Val Ser 
1140 1145 1150 

Ala Leu Ala Val lie Ser Tyr Val Thr Pro Val Phe Leu Val Ala Leu 
1155 1160 1165 

Leu Pro Leu Ala Val Val Cys Tyr Phe lie Gin Lys Tyr Phe Arg Val 
1170 1175 1180 

Ala Ser Arg Asp Leu Gin Gin Leu Asp Asp Thr Thr Gin Leu Pro Leu 
1185 1190 1195 1200 

Leu Ser His Phe Ala Glu Thr Val Glu Gly Leu Thr Thr lie Arg Ala 
1205 1210 1215 

Phe Arg Tyr Glu Ala Arg Phe Gin Gin Lys Leu Leu Glu Tyr Thr Asp 
1220 1225 1230 

Ser Asn Asn lie Ala Ser Leu Phe Leu Thr Ala Ala Asn Arg Trp Leu 
1235 1240 1245 

Glu Val Arg Met Glu Tyr He Gly Ala Cys Val Val Leu lie Ala Ala 
1250 1255 1260 

Ala Thr Ser He Ser Asn Ser Leu His Arg Glu Leu Ser Ala Gly Leu 
1265 1270 1275 1280 

Val Gly Leu Gly Leu Thr Tyr Ala Leu Met Val Ser Asn Tyr Leu Asn 
1285 1290 1295 

Trp Met Val Arg Asn Leu Ala Asp Met Glu He Gin Leu Gly Ala Val 
1300 1305 1310 

Lys Gly He His Thr Leu Leu Lys Thr Glu Ala Glu Ser Tyr Glu Gly 
1315 1320 1325 

Leu Leu Ala Pro Ser Leu He Pro Lys Asn Trp Pro Asp Gin Gly Lys 
1330 1335 1340 

He Gin He Gin Asn Leu Ser Val Arg Tyr Asp Ser Ser Leu Lys Pro 
1345 1350 1355 1360 

Val Leu Lys His Val Asn Ala Leu lie Ser Pro Gly Gin Lys lie Gly 
1365 1370 1375 

lie Cys Gly Arg Thr Gly Ser Gly Lys Ser Ser Phe Ser Leu Ala Phe 
1380 1385 1390 

Phe Arg Met Val Asp Met Phe Glu Gly Arg He He He Asp Gly lie 
1395 1400 1405 

Asp He Ala Lys Leu Pro Leu His Thr Leu Gly Ser Arg Leu Ser lie 
1410 1415 1420 

lie Leu Gin Asp Pro Val Leu Phe Ser Gly Thr lie Arg Phe Asn Leu 
1425 1430 1435 1440 

Asp Pro Glu Lvs Lys Cys Ser Asp Ser Thr Leu Trp Glu Ala Leu Glu 
1445 * 1450 1455 

He Ala Gin Leu Lys Leu Val Val Lys Ala Leu Pro Gly Gly Leu Asp 
1460 1465 1470 

Ala lie He Thr Glu Gly Gly Glu Asn Phe Ser Gin Gly Gin Arg Gin 
1475 1480 1485 
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Leu Phe Cys Leu Ala Arg Ala Phe Val Arg Lys Thr Ser He Phe He 
1490 1495 1500 

Met Asp Glu Ala Thr Ala Ser He Asp Met Ala Thr Glu Asn lie Leu 
1505 1510 1515 1520 

-Gin Lys Val Val Met Thr Ala Phe Ala Asp Arg Thr Val Val Thr He 
1525 1530 1535 

Ala His Arg Val His Thr He Leu Ser Ala Asp Leu Val Met Val Leu 
1540 1545 1550 

Lys Arg Gly Ala He Leu Glu Phe Asp Lys Pro Glu Lys Leu Leu Ser 
1555 1560 1565 

Gin Lys Asp Ser Val Phe Ala Ser Phe Val Arg Ala Asp 
1570 1575 1580 

{34) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4877 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 



AGCCGAGCCC 


GTGCGCGCGC 


CGCCATGCCC 


TTGGCCTTCT 


GCGGTACCGA 


GAAC C ACTCG 


60 


GCCGCCTACC 


GGGTGGACCA 


GGGCGTCCTC 


AACAACGGCT 


GCTTCGTGGA 


CGCGCTCAAC 


120 


GTGGTGCCGC 


ACGTTTTCCT 


GCTCTTCATC 


ACCTTCCCCA 


TCCTCTTCAT 


CGGATGGGGC 


180 


AGCCAGAGCT 


CCAAGGTGCA 


CATCCACCAC 


AGCACCTGGC 


TGCACTTTCC 


AGGGCACAAC 


2 40 


CTGCGCTGGA 


TCCTTACCTT 


CATTTTGCTC 


TTCGTCCTTG 


TG T G TG AG AT 


CGCTGAGGGC 


300 


ATCCTGTCTG 


ATGGGGTGAC 


AGAATCCCGC 


CACCTCCACC 


TGTACATGCC 


AGCCGGGATG 


360 


GCGTTCATGG 


CTGCCATCAC 


CTCTGTAGTC 


TACTATCATA 


AC AT CG AG AC 


CTCCAACTTC 


420 


CCCAAGCTTT 


TGATCGCTCT 


GCTCATCTAT 


TGGACCCTGG 


CCTTCATCAC 


G AAG AC CAT C 


480 


AAGTTTGTCA 


AGTTCTATGA 


CCACGCCATC 


GGCTTCTCCC 


AGCTGCGCTT 


CTGCCTCACG 


540 


GGGCTTCTGG 


TGATCCTGTA 


TGGGATGTTG 


CTGCTTGTGG 


AGGTCAACGT 


CAT C AG AG TG 


600 


AGGAGGTACA 


TCTTCTTCAA 


GACGCCACGG 


GAGGTGAAGC 


CCCCTGAGGA 


CCTGCAGGAC 


660 


CTGGGTGTGC 


GCTTTCTGCA 


GCCCTTCGTT 


AACCTGCTGT 


CAAAGGGGAC 


CTATTGGTGG 


720 


ATGAATGCCT 


TCATCAAGAC 


GGCCCACAAG 


AAGCCCATCG 


ACCTGCGGGC 


CATCGCGAAG 


780 


CTGCCCATCG 


CCATGAGAGC 


CCTCACCAAC 


TATCAGCGCC 


TCTGCGTGGC 


CTTCGATGCT 


840 


CAGGCGCGGA 


AGGACACACA 


GAGCCCACAG 


GGTGCCCGGG 


CCATCTGGAG 


GGCTCTATGC 


900 


CATGCCTTTG 


GGAGACGCCT 


GATCCTCAGC 


AGCACATTCC 


GCATCCTGGC 


TGACCTGTTG 


960 
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GGCTTCGCTG 


GACCACTCTG 


CATCTTTGGG 


ATCGTGGACC 


ACCTGGGGAA 


GGAGAACCAC 


1020 


GTCTTCCAGC 


C C AAG AC AC A 


GTTTCTCGGG 


GTTTACTTCG 


TCTCTTCTCA 


AGAGTTCCTT 


1080 


GGCAATGCCT 


ACGTCTTGGC 


CGTGCTTCTG 


TTCCTTGCCC 


TGCTACTGCA 


AAGGACATTC 


1140 


CTGCAAGCCT 


CATACTACGT 


CGCCATTGAA 


ACTGGAATTA 


ACCTGAGAGG 


AGCAATCCAG 


1200 


ACCAAGATTT 


ACAATAAAAT 


CATGCACATG 


TCCACCTCCA 


ACCTGTCAAT 


GGGGGAAATG 


1260 


ACTGCTGGGC 


AGATCTGCAA 


CCTGGTGGCC 


ATCGACACAA 


ACCAGCTCAT 


GTGGTTCTTC 


1320 


TTTCTGTGCC 


CAAACCTCTG 


GACGATGCCA 


GTACAGATCA 


TTGTGGGCGT 


GAT C CTTCT C 


1380 


TACTACATCC 


TTGGGGTCAG 


TGCCTTGATT 


GGAGCAGCTG 


TCATCATTCT 


GCTGGCTCCT 


1440 


GT AC AG TACT 


TTGTGGCCAC 


CAAGCTCTCC 


CAGGCACAGC 


GGACGACCTT 


GGAGCACTCC 


1500 


AACGAGAGGC 


TGAAGCAGAC 


CAACGAGATG 


CTCCGGGGCA 


TGAAGCTGCT 


CAAACTGTAT 


1560 


GCGTGGGAGA 


GCATCTTCTG 


CTCCAGGGTG 


GAGGTGACTC 


GCAGGAAGGA 


GATGACCAGC 


1620 


CTGAGGGCGT 


TTGCTGTCTA 


CACTTCCATC 


TCCATCTTCA 


TGAACACAGC 


CATCCCCATT 


1680 


GCTGCCGTCC 


TCATCACCTT 


CGTGGGCCAC 


GTCAGCTTCT 


TCAAAGAGTC 


GGACTTGTCA 


1740 


CCCTCGGTGG 


CCTTTGCCTC 


CCTCTCTCTC 


TTCCACATCC 


TGGTCACTCC 


ACTGTTCCTG 


1800 


CTGTCTAGCG 


TGGTTCGGTC 


C A C TG TC AAA 


GCCCTGGTGA 


GCGTGCAAAA 


ACTGAGCGAG 


1860 


TTCCTGTCTA 


GTGCAOAGAT 


CCGTGAGGAG 


CAGTGTGCCC 


CCCGAGAGCC 


TGCACCCCAA 


1920 


GGCCAAGCCG 


GCAAGTACCA 


GGCAGTGCCC 


CTCAAGGTTG 


TGAACCGCAA 


ACGCCCAGCC 


1980 


CGGGAAGAGG 


TCCGGGACCT 


CCTGGGCCCA 


CTGCAGAGGC 


TGGCCCCTAG 


CATGGACGGG 


2040 


GATGCTGACA 


ACTTCTGTGT 


C CAG AT CATC 


GGAGG CTTCT 


TCACCTGGAC 


CCCTGATGGA 


2100 


ATCCCCACTC 


TGTCCAACAT 


CACCATCCGT 


ATTCCCCGAG 


GTCAGCTAAC 


CATGATTGTG 


2160 


GGGCAGGTGG 


GCTGCGGCAA 


GTCCTCGCTC 


CTCCTCGCCA 


CCCTGGGGGA 


GATGCAGAAG 


2220 


GTGTCGGGGG 


CCGTCTTCTG 


GAACAGCAAC 


CTTCCGGACA 


GCGAGGGGAG 


AG G AC C C CAG 


2280 


C AG C C C AG AG 


CGGGAGACAG 


CAGCTGGCTC 


GGATATCAGG 


AGCAGAGGCC 


CCGTGGCTAC 


2340 


G CAT CT C AG A 


AACCATGGCT 


GCTAAACGCC 


ACCGTGGAAG 


AGAACATCAC 


CTTCGAGAGT 


2400 


CCCTTCAATC 


CGCAGCGGTA 


CAAGATGGTC 


ATCGAAGCCT 


GCTCCCTGCA 


GCCGGACATA 


2460 


GACATCCTGC 


CCCACGGAGA 


CCAGACTCAG 


ATTGGGGAAC 


GGGGCATCAA 


CCTGTCTGGT 


2520 


GGTCAGCGTC 


C AG AT CAG TG 


TG G T C CAG AG 


CC CTCTACCA 


GCAGACCAAT 


GTTCGTCTTC 


2580 


TTGGATGACC 


CCTTCTCAGC 


TTTGGATGTC 


CATCTGAGTG 


ACCACCTGAT 


GCAGGCCGGC 


2640 


ATCCTTGAGC 


TGCTCCGGGA 


TGACAAGAGG 


ACAGTGGTCT 


TGGTGACCCA 


CAAGCTACAG 


2700 


TATCTGCCTC 


ATGCAGACTG 


GATCATTGCC 


ATGAAGGATG 


GGACCATTCA 


GAGGGAAGGG 


2760 


ACGCTCAAGG 


ACTTCCAGAG 


GTCCGAGTGC 


CAGCTCTTTG 


AG C AC TG G AA 


GACCCTCATG 


2820 


AACCGGCAGG 


ACCAAGAGCT 


GGAGAAGGAG 


ACAGTCATGG 


AGAGGAAAGC 


CTCAGAGCCA 


2880 


TCTCAGGGCC 


TGCCCCGTGC 


CATGTCCTCC 


AGAGACGGCC 


TTCTGCTGGA 


TGAGGAAGAG 


2940 


GAGGAAGAGG 


AGGCAGCCGA 


AAGCGAGGAA 


G AT G AC AAC T 


TATCTTCAGT 


GCTGCATCAG 


3000 
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CGAGCTAAGA 


TCCCCTGGCG 


AGCCTGCACT 


AAGTATCTGT 


CCTCTGCTGG 


CATTCTGCTC 


3060 


CTGTCCCTGC 


TTGTCTTCTC 


CCAGCTGCTC 


AAGCACATGG 


TCTTGGTGGC 


C AT TG AT TAT 


3120 


TGGCTGGCCA 


AGTGGACGGA 


CAGTGCCCTG 


GTCCTGAGCC 


CCGCTGCCAG 


GAACTGTTCG 


3180 


CTCAGCCAGG 


AATGTGACCT 


GGACCAGTCT 


GTCTATGCCA 


TGGTATTCAC 


CTTGCTCTGC 


3240 


AGCCTGGGTA 


TCGTGCTGTG 


CCTGGTCACC 


TCTGTCACTG 


TGGAGTGGAC 


GGGACTGAAG 


3300 


GTGGCCAAGA 


GGCTACACCG 


CAGCCTGCTC 


AACCGCATCA 


TCCTGGCCCC 


CATGAGGTTC 


3360 


TTTGAGACCA 


CACCCCTCGG 


GAG TAT C C TG 


AACAGATTTT 


CATCCGACTG 


TAACACCATT 


3420 


GACCAGCACA 


TCCCATCCAC 


GCTGGAGTGT 


CTGAGCCGGT 


CCACCCTGCT 


GTGTGTCTCC 


3480 


GCCCTGACTG 


TCATCTCCTA 


TGTCACACCC 


GTGTTCCTCG 


TGGCCCTCTT 


ACCCCTAGCT 


3540 


GTTGTGTGCT 


ACTTCATTCA 


GAAGTACTTC 


CGAGTGGCAT 


CCAGGGACCT 


GCAGCAGCTG 


3600 


GACGACACGA 


CGCAGCTCCC 


GCTCGTCTCA 


CACTTTGCTG 


AAACTGTGGA 


GGGACTCACC 


3660 


ACCATCCGTG 


CCTTCAGGTA 


CGAGGCCCGG 


TT C C AG C AG A 


AGCTTCTAGA 


ATATACCGAC 


3720 


TCCAACAACA 


TCGCCTCCCT 


CTTCCTCACG 


GCAGCCAACA 


GATGGCTGGA 


AGTCTGCATG 


3780 


GAG T AC AT CG 


GAGCGTGCGT 


GGTACTCATT 


GCGGCTGCCA 


CCTCCATCTC 


CAACTCCCTG 


3840 


CACAGGGAAC 


TTTCTGCTGG 


CCTGGTGGGC 


CTGGGCCTCA 


CCTATGCCTT 


GATGGTCTCC 


3900 


AACTACCTCA 


ACTGGATGGT 


GAGGAACCTG 


GCGGACATGG 


AGATCCAGCT 


GGGGG CTGTG 


3960 


AAGAGGATCC 


ACGCACTCCT 


GAAAAC CG AG 


GCGGAGAGCT 


ATGAGGGGCT 


CCTGGCGCCG 


4020 


TCGTTGATCC 


CCAAGAACTG 


GCCAGACCAA 


GGGAAGATCC 


AAATTCAGAA 


CCTGAGCGTG 


4080 


CGCTATGACA 


GCTCCCTGAA 


GCCAGTGCTG 


AAGCATGTCA 


ACACCCTCAT 


CTCCCCGGGG 


4140 


CAGAAGATCG 


GGATCTGCGG 


CCGCACAGGC 


A3CGGGAAGT 


CCTCCTTCTC 


CCTGGCCTTT 


4200 


TTCCGAATGG 


TGGACATGTT 


TGAAGGACGC 


AT CAT CAT TG 


ATGGCATCGA 


CATCGCCAAG 


4260 


CTGCCACTTC 


ACACGCTGCG 


CTCACGCCTG 


TCCATCATCC 


T AC AG G AC C C 


CGTCCTCTTC 


4320 


AGCGGCACGA 


TCAGATTCAA 


CCTGGACCCC 


GAGAAGAAAT 


GCTCAGACAG 


CACACTGTGG 


4380 


GAGGCCCTGG 


AGATCGCCCA 


GCTGAAGCTG 


GTAGTGAAGG 


CACTGCCAGG 


AGGCCTAGAT 


4440 


G C CAT CAT C A 


CAGAAGGAGG 


GGAGAATTTT 


AGCCAGGGCC 


AGAGGCAGCT 


GTTCTGCCTG 


4500 


GCCCGGGCCT 


TCGTGAGGAA 


GACCAGCATC 


TT CAT CAT GG 


ATGAAG CAAC 


CGCCTCCATC 


4560 


GACATGGCTA 


CGGAGAACAT 


CCTCCAGAAG 


GTGGTGATGA 


CAGCCTTCGC 


AGACCGCACG 


4620 


GTGGTCACCA 


TCGCGCATCG 


TGTGCACACC 


ATCCTGAGTG 


CAGACCTGGT 


GATGGTCCTC 


4680 


AAGAGGGGTG 


CTATCCTGGA 


GTTTGACAAG 


CCAGAGACGC 


TCCTCAGCCA 


GAAGGACAGC 


4740 


GTGTTCGCCT 


CCTTTGTCCG 


TGCGGACAAG 


TGACTTACCG 


GAGCCAAAGT 


GCCACCCCGC 


4800 


GCCTCGCTTG 


CTTGCCTAGG 


ATTTCTAACT 


GCAAATCACT 


TGTAAATAAA 


TTAATTCTTT 


4860 


GCTAAAAAAA 


AAAAAAA 










4877 
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(A) LENGTH: 4877 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 25,. 4770 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

AGCCGAGCCC GTGCGCGCGC CGCC ATG CCC TTG GCC TTC TGC GGT ACC GAG 51 

Met Pro Leu Ala Phe Cys Gly Thr Glu 
1 5 

AAC CAC TCG GCC GCC TAC CGG GTG GAC CAG GGC GTC CTC AAC AAC GGC 99 

Asn His Ser Ala Ala Tyr Arg Val Asp Gin Gly Val Leu Asn Asn Gly 

10 15 20 25 

TGC TTC GTG GAC GCG CTC AAC GTG GTG CCG CAC GTT TTC CTG CTC TTC 14 7 

Cys Phe Val Asp Ala Leu Asn Val Val Pro His Val Phe Leu Leu Phe 

30 35 40 

ATC ACC TTC CCC ATC CTC TTC ATC GGA TGG GGC AGC CAG AGC TCC AAG 19 5 

lie Thr Phe Pro lie Leu Phe He Gly Trp Gly Ser Gin Ser Ser Lys 

45 50 55 

GTG CAC ATC CAC CAC AGC ACC TGG CTG CAC TTT CCA GGG CAC AAC CTG 243 

Val His He His His Ser Thr Trp Leu His Phe Pro Gly His Asn Leu 

60 65 70 

CGC TGG ATC CTT ACC TTC ATT TTG CTC TTC GTC CTT GTG TGT GAG ATC 291 

Arg Trp He Leu Thr Phe He Leu Leu Phe Val Leu Val Cys Glu He 
75 80 85 

GCT GAG GGC ATC CTG TCT GAT GGG GTG ACA GAA TCC CGC CAC CTC CAC 33 9 

Ala Glu Gly He Leu Ser Asp Gly Val Thr Glu Ser Arg His Leu His 

90 95 100 105 

CTG TAC ATG CCA GCC GGG ATG GCG TTC ATG GCT GCC ATC ACC TCT GTA 38 7 

Leu Tyr Met Pro Ala Gly Met Ala Phe Met Ala Ala He Thr Ser Val 

110 115 120 

GTC TAC TAT CAT AAC ATC GAG ACC TCC AAC TTC CCC AAG CTT TTG ATC 43 5 

Val Tyr Tyr His Asn He Glu Thr Ser Asn Phe Pro Lys Leu Leu He 

125 130 135 

GCT CTG CTC ATC TAT TGG ACC CTG GCC TTC ATC ACG AAG ACC ATC AAG 48 3 

Ala Leu Leu He Tyr Trp Thr Leu Ala Phe He Thr Lys Thr lie Lys 
140 " 145 X50 

TTT GTC AAG TTC TAT GAC CAC GCC ATC GGC TTC TCC CAG CTG CGC TTC 531 

Phe Val Lvs Phe Tvr Asp His Ala He Gly Phe Ser Gin Leu Arg Phe 
155 * ' 160 " 165 

TGC CTC ACG GGG CTT CTG GTG ATC CTG TAT GGG ATG TTG CTG CTT GTG 57 9 

Cys Leu Thr Gly Leu Leu Val lie Leu Tyr Gly Met Leu Leu Leu Val 

170 175 180 185 
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GAG GTC AAC GTC ATC AGA GTG AGG AGG TAC ATC TTC TTC AAG ACG CCA 62 7 

Glu Val Asn Val lie Arg Val Arg Arg Tyr He Phe Phe Lys Thr Pro 
190 195 200 

CGG GAG GTG AAG CCC CCT GAG GAC CTG CAG GAC CTG GGT GTG CGC TTT 67 5 

Arg Glu Val Lys Pro Pro Glu Asp Leu Gin Asp Leu Gly Val Arg Phe 
205 210 215 

CTG CAG CCC TTC GTT AAC CTG CTG TCA AAG GGG ACC TAT TGG TGG ATG 723 
Leu Gin Pro Phe Val Asn Leu Leu Ser Lys Gly Thr Tyr Trp Trp Met 
220 225 230 

AAT GCC TTC ATC AAG ACG GCC CAC AAG AAG CCC ATC GAC CTG CGG GCC 7 71 

Asn Ala Phe lie Lys Thr Ala His Lys Lys Pro He Asp Leu Arg Ala 
235 240 245 

ATC GCG AAG CTG CCC ATC GCC ATG AGA GCC CTC ACC AAC TAT CAG CGC 819 
He Ala Lys Leu Pro He Ala Met Arg Ala Leu Thr Asn Tyr Gin Arg 
250 255 260 265 

CTC TGC GTG GCC TTC GAT GCT CAG GCG CGG AAG GAC ACA CAG AGC CCA 86 7 

Leu Cys Val Ala Phe Asp Ala Gin Ala Arg Lys Asp Thr Gin Ser Pro 
270 275 280 

CAG GGT GCC CGG GCC ATC TGG AGG GCT CTA TGC CAT GCC TTT GGG AGA 915 
Gin Gly Ala Arg Ala He Trp Arg Ala Leu Cys His Ala Phe Gly Arg 
285 290 295 

CGC CTG ATC CTC AGC AGC ACA TTC CGC ATC CTG GCT GAC CTG TTG GGC 963 
Arg Leu He Leu Ser Ser Thr Phe Arg He Leu Ala Asp Leu Leu Gly 
300 305 310 

TTC GCT GGA CCA CTC TGC ATC TTT GGG ATC GTG GAC CAC CTG GGG AAG 1011 
Phe Ala Gly Pro Leu Cys lie Phe Gly He Val Asp His Leu Gly Lys 
315 320 325 

GAG AAC CAC GTC TTC CAG CCC AAG ACA CAG TTT CTC GGG GTT TAC TTC 1059 
Glu Asn His Val Phe Gin Pro Lys Thr Gin Phe Leu Gly Val Tyr Phe 
330 335 340 345 

GTC TCT TCT CAA GAG TTC CTT GGC AAT GCC TAC GTC TTG GCC GTG CTT 110 7 

val Ser Ser Gin Glu Phe Leu Gly Asn Ala Tyr Val Leu Ala Val Leu 
350 355 360 

CTG TTC CTT GCC CTG CTA CTG CAA AGG ACA TTC CTG CAA GCC TCA TAC 115 5 

Leu Phe Leu Ala Leu Leu Leu Gin Arg Thr Phe Leu Gin Ala Ser Tyr 
365 370 375 

TAC GTC GCC ATT GAA ACT GGA ATT AAC CTG AGA GGA GCA ATC CAG ACC 12 03 

Tyr Val Ala He Glu Thr Gly He Asn Leu Arg Gly Ala He Gin Thr 
380 385 390 

AAG ATT TAC AAT AAA ATC ATG CAC ATG TCC ACC TCC AAC CTG TCA ATG 12 51 

Lys lie Tyr Asn Lys He Met His Met Ser Thr Ser Asn Leu Ser Met 
395 400 405 

GGG GAA ATG ACT GCT GGG CAG ATC TGC AAC CTG GTG GCC ATC GAC ACA 12 9 9 

Gly Glu Met Thr Ala Gly Gin He Cys Asn Leu Val Ala lie Asp Thr 
410 415 ' 420 425 

AAC CAG CTC ATG TGG TTC TTC TTT CTG TGC CCA AAC CTC TGG ACG ATG 134 7 

Asn Gin Leu Met Trp Phe Phe Phe Leu Cys Pro Asn Leu Trp Thr Met 
430 435 440 

CCA GTA CAG ATC ATT GTG GGC GTG ATC CTT CTC TAC TAC ATC CTT GGG 13 9 5 

Pro Val Gin He He Val Gly Val lie Leu Leu Tyr Tyr He Leu Gly 
445 450 455 
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GTC AGT GCC TTG ATT GGA GCA GCT GTC ATC ATT CTG CTG GCT CCT GTA 144 3 

Val Ser Ala Leu He Gly Ala Ala Val lie He Leu Leu Ala Pro Val 
460 465 470 

CAG TAG TTT GTG GCC ACC AAG CTC TCC CAG GCA CAG CGG ACG ACC TTG 1491 

Gin Tyr Phe Val Ala Thr Lys Leu Ser Gin Ala Gin Arg Thr Thr Leu 
475 480 485 

GAG CAC TCC AAC GAG AGG CTG AAG CAG ACC AAC GAG ATG CTC CGG GGC 1539 

Glu His Ser Asn Glu Arg Leu Lys Gin Thr Asn Glu Met Leu Arg Gly 

490 495 500 505 

ATG AAG CTG CTC AAA CTG TAT GCG TGG GAG AGC ATC TTC TGC TCC AGG 15 87 

Met Lys Leu Leu Lys Leu Tyr Ala Trp Glu Ser He Phe Cys Ser Arg 

510 515 520 

GTG GAG GTG ACT CGC AGG AAG GAG ATG ACC AGC CTG AGG GCG TTT GCT 163 5 

Val Glu Val Thr Arg Arg Lys Glu Met Thr Ser Leu Arg Ala Phe Ala 
525 530 535 

GTC TAC ACT TCC ATC TCC ATC TTC A1G AAC ACA GCC ATC CCC ATT GCT 16 83 

Val Tyr Thr Ser He Ser He Phe Met Asn Thr Ala He Pro lie Ala 
540 545 550 

GCC GTC CTC ATC ACC TTC GTG GGC CAC GTC AGC TTC TTC AAA GAG TCG 1731 

Ala Val Leu He Thr Phe Val Gly His Val Ser Phe Phe Lys Glu Ser 
555 560 565 

GAC TTG TCA CCC TCG GTG GCC TTT GCC TCC CTC TCT CTC TTC CAC ATC 177 9 

Asp Leu Ser Pro Ser Val Ala Phe Ala Ser Leu Ser Leu Phe His He 

570 575 580 585 

CTG GTC ACT CCA CTG TTC CTG CTG TCT AGC GTG GTT CGG TCC ACT GTC 1827 

Leu Val Thr Pro Leu Phe Leu Leu Ser Ser Val Val Arg Ser Thr Val 

590 595 600 

AAA GCC CTG GTG AGC GTG CAA AAA CTG AGC GAG TTC CTG TCT AGT GCA 187 5 

Lys Ala Leu Val Ser Val Gin Lys Leu Ser Glu Phe Leu Ser Ser Ala 
605 610 615 

GAG ATC CGT GAG GAG CAG TGT GCC CCC CGA GAG CCT GCA CCC CAA GGC 192 3 

Glu He Arg Glu Glu Gin Cys Ala Pro Arg Glu Pro Ala Pro Gin Gly 
620 625 630 

CAA GCC GGC AAG TAC CAG GCA GTG CCC CTC AAG GTT GTG AAC CGC AAA 1971 

Gin Ala Gly Lys Tyr Gin Ala Val Pro Leu Lys Val Val Asn Arg Lys 
635 640 645 

CGC CCA GCC CGG GAA GAG GTC CGG GAC CTC CTG GGC CCA CTG CAG AGG 2019 

Arg Pro Ala Arg Glu Glu Val Arg Asp Leu Leu Gly Pro Leu Gin Arg 

650 655 660 665 

CTG GCC CCT AGC ATG GAC GGG GAT GCT GAC AAC TTC TGT GTC CAG ATC 206 7 

Leu Ala Pro Ser Met Asp Gly Asp Ala Asp Asn Phe Cys Val Gin He 

670 675 680 

ATC GGA GGC TTC TTC ACC TGG ACC CCT GAT GGA ATC CCC ACT CTG TCC 2115 

He Gly Gly Phe Phe Thr Trp Thr Pro Asp Gly He Pro Thr Leu Ser 
685 690 695 

AAC ATC ACC ATC CGT ATT CCC CGA GGT CAG CTA ACC ATG ATT GTG GGG 216 3 

Asn He Thr He Arg He Pro Arg Gly Gin Leu Thr Met He Val Gly 
700 7D5 710 

CAG GTG GGC TGC GGC AAG TCC TCG CTC CTC CTC GCC ACC CTG GGG GAG 2211 

Gin Val Gly Cvs Gly Lys Ser Ser Leu Leu Leu Ala Thr Leu Gly Glu 
715 " 720 725 
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ATG CAG AAG GTG TCG GGG GCC GTC TTC TGG AAC AGC AAC CTT CCG GAC 22 5 9 

Met Gin Lys Val Ser Gly Ala Val Phe Trp Asn Ser Asn Leu Pro Asp 
730 735 740 745 

AGC GAG GGG AGA GGA CCC CAG CAG CCC AGA GCG GGA GAC AGC AGC TGG 2 3 07 

Ser Glu Gly Arg Gly Pro Gin Gin Pro Arg Ala Gly Asp Ser Ser Trp 
750 755 760 

CTC GGA TAT CAG GAG CAG AGG CCC CGT GGC TAC GCA TCT CAG AAA CCA 23 5 5 

Leu Gly Tyr Gin Glu Gin Arg Pro Arg Gly Tyr Ala Ser Gin Lys Pro 
765 770 775 

TGG CTG CTA AAC GCC ACC GTG GAA GAG AAC ATC ACC TTC GAG AGT CCC 24 03 

Trp Leu Leu Asn Ala Thr Val Glu Glu Asn He Thr Phe Glu Ser Pro 
780 785 790 

TTC AAT CCG CAG CGG TAC AAG ATG GTC ATC GAA GCC TGC TCC CTG CAG 24 51 

Phe Asn Pro Gin Arg Tyr Lys Met Val He Glu Ala Cys Ser Leu Gin 
795 800 805 

CCG GAC ATA GAC ATC CTG CCC CAC GGA GAC CAG ACT CAG ATT GGG GAA 24 99 

Pro Asp He Asp He Leu Pro His Gly Asp Gin Thr Gin He Gly Glu 
810 815 820 825 

CGG GGC ATC AAC CTG TCT GGT GGT CAG CGT CCA GAT CAG TGT GGT CCA 254 7 

Arg Gly He Asn Leu Ser Gly Gly Gin Arg Pro Asp Gin Cys Gly Pro 
830 835 840 

GAG CCC TCT ACC AGC AGA CCA ATG TTC GTC TTC TTG GAT GAC CCC TTC 2 5 95 

Glu Pro Ser Thr Ser Arg Pro Met Phe Val Phe Leu Asp Asp Pro Phe 
845 850 855 

TCA GCT TTG GAT GTC CAT CTG AGT G£C CAC CTG ATG CAG GCC GGC ATC 2 64 3 

Ser Ala Leu Asp Val His Leu Ser Asp His Leu Met Gin Ala Gly He 
860 865 870 

CTT GAG CTG CTC CGG GAT GAC AAG AGG ACA GTG GTC TTG GTG ACC CAC 2 6 91 

Leu Glu Leu Leu Arg Asp Asp Lys Arg Thr Val Val Leu Val Thr His 
875 880 885 

AAG CTA CAG TAT CTG CCT CAT GCA GAC TGG ATC ATT GCC ATG AAG GAT 273 9 

Lys Leu Gin Tyr Leu Pro His Ala Asp Trp He He Ala Met Lys Asp 
890 895 900 905 

GGG ACC ATT CAG AGG GAA GGG ACG CTC AAG GAC TTC CAG AGG TCC GAG 278 7 

Gly Thr He Gin Arg Glu Gly Thr Leu Lys Asp Phe Gin Arg Ser Glu 
910 915 920 

TGC CAG CTC TTT GAG CAC TGG AAG ACC CTC ATG AAC CGG CAG GAC CAA 28 3 5 

Cys Gin Leu Phe Glu His Trp Lys Thr Leu Met Asn Arg Gin Asp Gin 
925 930 935 

GAG CTG GAG AAG GAG ACA GTC ATG GAG AGG AAA GCC TCA GAG CCA TCT 2 8 83 

Glu Leu Glu Lys Glu Thr Val Met Glu Arg Lys Ala Ser Glu Pro Ser 
. 940 945 950 

CAG GGC CTG CCC CGT GCC ATG TCC TCC AGA GAC GGC CTT CTG CTG GAT 2 931 

Gin Gly Leu Pro Arg Ala Met Ser Ser Arg Asp Gly Leu Leu Leu Asp 
955 960 965 

GAG GAA GAG GAG GAA GAG GAG GCA GCC GAA AGC GAG GAA GAT GAC AAC 2 97 9 

Glu Glu Glu Glu Glu Glu Glu Ala Ala Glu Ser Glu Glu Asp Asp Asn 
970 975 980 985 

TTA TCT TCA GTG CTG CAT CAG CGA GCT AAG ATC CCC TGG CGA "CC TGC 3 02 7 

Leu Ser Ser Val Leu His Gin Arg Ala Lys lie Pro Trp Arg Ala Cys 
990 59? 1000 
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ACT AAG TAT CTG TCC TCT GCT GGC ATT CTG CTC CTG TCC CTG CTT GTC 3 075 

Thr Lys Tyr Leu Ser Ser Ala Gly lie Leu Leu Leu Ser Leu Leu Val 
1005 1010 1015 

TTC TCC CAG CTG CTC AAG CAC ATG GTC TTG GTG GCC ATT GAT TAT TGG 3123 
Phe Ser Gin Leu Leu Lys His Met Val Leu Val Ala lie Asp Tyr Trp 
1020 1025 1030 

CTG GCC AAG TGG ACG GAC AGT GCC CTG GTC CTG AGC CCC GCT GCC AGG 3171 
Leu Ala Lys Trp Thr Asp Ser Ala Leu Val Leu Ser Pro Ala Ala Arg 
1035 1040 1045 

AAC TGT TCG CTC AGC CAG GAA TGT GAC CTG GAC CAG TCT GTC TAT GCC 3219 
Asn Cys Ser Leu Ser Gin Glu Cys Asp Leu Asp Gin Ser Val Tyr Ala 
1050 1055 1060 1065 

ATG GTA TTC ACC TTG CTC TGC AGC CTG GGT ATC GTG CTG TGC CTG GTC 326 7 

Met Val Phe Thr Leu Leu Cys Ser Leu Gly lie Val Leu Cys Leu Val 
1070 1075 1080 

ACC TCT GTC ACT GTG GAG TGG ACG GGA CTG AAG GTG GCC AAG AGG CTA 3315 
Thr Ser Val Thr Val Glu Trp Thr Gly Leu Lys Val Ala Lys Arg Leu 
1085 1090 1095 

CAC CGC AGC CTG CTC AAC CGC ATC ATC CTG GCC CCC ATG AGG TTC TTT 336 3 

His Arg Ser Leu Leu Asn Arg lie He Leu Ala Pro Met Arg Phe Phe 
1100 1105 1110 

GAG ACC ACA CCC CTC GGG AGT ATC CTG AAC AGA TTT TCA TCC GAC TGT 3411 
Glu Thr Thr Pro Leu Gly Ser lie Leu Asn Arg Phe Ser Ser Asp Cys 
1115 1120 1125 

AAC ACC ATT GAC CAG CAC ATC CCA TCC ACG CTG GAG TGT CTG AGC CGG 3 45 9 

Asn Thr He Asp Gin His He Pro Ser Thr Leu Glu Cys Leu Ser Arg 
1130 1135 1140 1145 

TCC ACC CTG CTG TGT GTC TCC GCC CTG ACT GTC ATC TCC TAT GTC ACA 3 50 7 

Ser Thr Leu Leu Cys Val Ser Ala Leu Thr Val He Ser Tyr Val Thr 
1150 1155 1160 

CCC GTG TTC CTC GTG GCC CTC TTA CCC CTA GCT GTT GTG TGC TAG TTC 3 55 5 

Pro Val Phe Leu Val Ala Leu Leu Pro Leu Ala Val Val Cys Tyr Phe 
1165 1170 1175 

ATT CAG AAG TAC TTC CGA GTG GCA TCC AGG GAC CTG CAG CAG CTG GAC 3 603 

He Gin Lys Tyr Phe Arg Val Ala Ser Arg Asp Leu Gin Gin Leu Asp 
1180 1185 1190 

GAC ACG ACG CAC4 CTC CCG CTC GTC TCA CAC TTT GCT GAA ACT GTG GAG 3 6 51 

Asp Thr Thr Gin Leu Pro Leu Val Ser His Phe Ala Glu Thr Val Glu 
1195 1200 1205 

GGA CTC ACC ACC ATC CGT GCC TTC AGG TAC GAG GCC CGG TTC CAG CAG 36 9 9 

Gly Leu Thr Thr He Arg Ala Phe Arg Tyr Glu Ala Arg Phe Gin Gin 
1210 1215 1220 1225 

AAG CTT CTA GAA TAT ACC GAC TCC AAC AAC ATC GCC TCC CTC TTC CTC 3 74 7 

Lys Leu Leu Glu Tyr Thr Asp Ser Asn Asn lie Ala Ser Leu Phe Leu 
1230 1235 1240 

ACG GCA GCC AAC AGA TGG CTG GAA GTC TGC ATG GAG TAC ATC GGA GCG 3 7 95 

Thr Ala Ala Asn Arg Trp Leu Glu Val Cys Met Glu Tyr He Gly Ala 
1245 1250 1255 

TGC GTG GTA CTC ATT GCG GCT GCC ACC TCC ATC TCC AAC TCC CTG CAC 3 8 43 

Cys Val Val Leu lie Ala Ala Ala Thr Ser He Ser Asn Ser Leu His 
1260 1265 1270 



SUBSTITUTE SHEET [RULE 26) 



BMSDOC1D: <WO 952641 1A1 J_> 



WO 95/28411 



PCT/US95/04463 



112 

AGO GAA CTT TCT GCT GGC CTG GTG GGC CTG GGC CTC ACC TAT GCC TTG 3 8 91 

Arg Glu Leu Ser Ala Gly Leu Val Gly Leu Gly Leu Thr Tyr Ala Leu 
1275 1280 1285 

ATG GTC TCC AAC TAC CTC AAC TGG ATG GTG AGG AAC CTG GCG GAC ATG 3 93 9 

Met Val Ser Asn Tyr Leu Asn Trp Met Val Arg Asn Leu Ala Asp Met 
1290 1295 1300 1305 

GAG ATC CAG CTG GGG GCT GTG AAG AGG ATC CAC GCA CTC CTG AAA ACC 3 98 7 

Glu He Gin Leu Gly Ala Val Lys Arg He His Ala Leu Leu Lys Thr 
1310 1315 1320 

GAG GCG GAG AGC TAT GAG GGG CTC CTG GCG CCG TCG TTG ATC CCC AAG 4035 
Glu Ala Glu Ser Tyr Glu Gly Leu Leu Ala Pro Ser Leu He Pro Lys 
1325 1330 1335 

AAC TGG CCA GAC CAA GGG AAG ATC CAA ATT CAG AAC CTG AGC GTG CGC 4 083 

Asn Trp Pro Asp Gin Gly Lys He Gin lie Gin Asn Leu Ser Val Arg 
1340 1345 1350 

TAT GAC AGC TCC CTG AAG CCA GTG CTG AAG CAT GTC AAC ACC CTC ATC 4131 
Tyr Asp Ser Ser Leu Lys Pro Val Leu Lys His Val Asn Thr Leu He 
1355 1360 1365 

TCC CCG GGG CAG AAG ATC GGG ATC TGC GGC CGC ACA GGC AGC GGG AAG 417 9 

Ser Pro Gly Gin Lys He Gly lie Cys Gly Arg Thr Gly Ser Gly Lys 
1370 1375 1380 1385 

TCC TCC TTC TCC CTG GCC TTT TTC CGA ATG GTG GAC ATG TTT GAA GGA 42 2 7 

Ser Ser Phe Ser Leu Ala Phe Phe Arg Met Val Asp Met Phe Glu Gly 
1390 1395 1400 

CGC ATC ATC ATT GAT GGC ATC GAC ATC GCC AAG CTG CCA CTT CAC ACG 42 7 5 

Arg He He He Asp Gly He Asp He Ala Lys Leu Pro Leu His Thr 
1405 1410 1415 

CTG CGC TCA CGC CTG TCC ATC ATC CTA CAG GAC CCC GTC CTC TTC AGC 4323 
Leu Arg Ser Arg Leu Ser He He Leu Gin Asp Pro Val Leu Phe Ser 
1420 1425 1430 

GGC ACG ATC AGA TTC AAC CTG GAC CCC GAG AAG AAA TGC TCA GAC AGC 4 3 71 

Gly Thr lie Arg Phe Asn Leu Asp Pro Glu Lys Lys Cys Ser Asp Ser 
1435 1440 1445 

ACA CTG TGG GAG GCC CTG GAG ATC GCC CAG CTG AAG CTG GTA GTG AAG 4419 
Thr Leu Trp Glu Ala Leu Glu lie Ala Gin Leu Lys Leu Val Val Lys 
1450 1455 1460 1465 

GCA CTG CCA GGA GGC CTA GAT GCC ATC ATC ACA GAA GGA GGG GAG AAT 44 6 7 

Ala Leu Pro Gly Gly Leu Asp Ala He He Thr Glu Gly Gly Glu Asn 
1470 1475 1480 

TTT AGC CAG GGC CAG AGG CAG CTG TTC TGC CTG GCC CGG GCC TTC GTG 4 515 

Phe Ser Gin Gly Gin Arg Gin Leu Phe Cys Leu Ala Arg Ala Phe Val 
1485 1490 1495 

AGG AAG ACC AGC ATC TTC ATC ATG GAT GAA GCA ACC GCC TCC ATC GAC 4 56 3 

Arg Lys Thr Ser lie Phe He Met Asp Glu Ala Thr Ala Ser He Asp 
1500 1505 1510 

ATG GCT ACG GAG AAC ATC CTC CAG AAG GTG GTG ATG ACA GCC TTC GCA 4611 
Met Ala Thr Glu Asn lie Leu Gin Lys Val Val Met Thr Ala Phe Ala 
1515 1520 1525 

GAC CGC ACG GTG GTC ACC ATC GCG CAT CGT GTG CAC ACC ATC CTG AGT 46 5 9 

Asp Arg Thr Val Val Thr He Ala His Arg Val His Thr lie Leu Ser 
1530 1535 1540 1545 
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GCA GAC CTG GTG ATG GTC CTC AAG AGG GGT GCT ATC CTG GAG TTT GAC 4*7 07 

Ala Asp Leu Val Met Val Leu Lys Arg Gly Ala lie Leu Glu Phe Asp 
1550 1555 1560 

AAG CCA GAG ACG CTC CTC AGC CAG AAG GAC AGC GTG TTC GCC TCC TTT 4 755 

Lys Pro Glu Thr Leu Leu Ser Gin Lys Asp Ser Val Phe Ala Ser Phe 
1565 1570 1575 

GTC CGT GCG GAC AAG TGACTTACCG GAGCCAAAGT GCCACCCCGC GCCTCGCTTG 4810 
Val Arg Ala Asp Lys 
1580 

CTTGCCTAGG ATTTCTAACT GCAAATCACT TG T AAAT AAA TTAATTCTTT GCTAAAAAAA 48 7 0 

AAAAAAA 4877 

(36) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1582 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Pro Leu Ala Phe Cys Gly Thr Glu Asn His Ser Ala Ala Tyr Arg 
15 10 15 

Val Asp Gin Gly Val Leu Asn Asn Gly Cys Phe Val Asp Ala Leu Asn 
20 25 30 

Val Val Pro His Val Phe Leu Leu Phe lie Thr Phe Pro lie Leu Phe 
35 40 45 

He Gly Trp Gly Ser Gin Ser Ser Lys Val His He His His Ser Thr 
50 55 60 

Trp Leu His Phe Pro Gly His Asn Leu Arg Trp He Leu Thr Phe He 
65 70 75 80 

Leu Leu Phe Val Leu Val Cys Glu He Ala Glu Gly He Leu Ser Asp 
85 90 95 

Gly Val Thr Glu Ser Arg His Leu His Leu Tyr Met Pro Ala Gly Met 
100 105 110 

Ala Phe Met Ala Ala He Thr Ser Val Val Tyr Tyr His Asn lie Glu 
115 120 125 

Thr Ser Asn Phe Pro Lys Leu Leu He Ala Leu Leu He Tyr Trp Thr 
130 135 140 

Leu Ala Phe He Thr Lys Thr lie Lys Phe Val Lys Phe Tyr Asp His 
145 150 155 160 

Ala lie Glv Phe Ser Gin Leu Arg Phe Cys Leu Thr Gly Leu Leu Val 
165 170 175 

He Leu Tyr Gly Met Leu Leu Leu Val Glu Val Asn Val lie Arg Val 
180 185 190 

Arg Arg Tvr He Phe Phe Lys Thr Pro Arg Glu Val Lys Pro Pro Glu 
195 ' 200 205 
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Asp Leu Gin Asp Leu Gly Val Arg Phe Leu Gin Pro Phe Val Asn Leu 
210 215 220 

Leu Ser Lys Gly Thr Tyr Trp Trp Met Asn Ala Phe lie Lys Thr Ala 
225 230 23S 240 

His Lys Lys Pro He Asp Leu Arg Ala He Ala Lys Leu Pro He Ala 
245 250 255 

Met Arg Ala Leu Thr Asn Tyr Gin Arg Leu Cys Val Ala Phe Asp Ala 
260 265 270 

Gin Ala Arg Lys Asp Thr Gin Ser Pro Gin Gly Ala Arg Ala He Trp 
275 280 285 

Arg Ala Leu Cys His Ala Phe Gly Arg Arg Leu He Leu Ser Ser Thr 
290 295 300 

Phe Arg He Leu Ala Asp Leu Leu Gly Phe Ala Gly Pro Leu Cys He 
305 310 315 320 

Phe Gly He Val Asp His Leu Gly Lys Glu Asn Kis Val Phe Gin Pro 
325 330 335 

Lys Thr Gin Phe Leu Gly Val Tyr Phe Val Ser Ser Gin Glu Phe Leu 
340 345 350 

Gly Asn Ala Tyr Val Leu Ala Val Leu Leu Phe Leu Ala Leu Leu Leu 
355 360 365 

Gin Arg Thr Phe Leu Gin Ala Ser Tyr Tyr Val Ala He Glu Thr Gly 
370 375 380 

He Asn Leu Arg Gly Ala He Gin Thr Lys He Tyr Asn Lys He Met 
385 390 395 400 

His Met Ser Thr Ser Asn Leu Ser Met Gly Glu Met Thr Ala Gly Gin 
405 410 415 

He Cys Asn Leu Val Ala He Asp Thr Asn Gin Leu Met Trp Phe Phe 
420 425 430 

Phe Leu Cys Pro Asn Leu Trp Thr Met Pro Val Gin He lie Val Gly 
435 440 445 

Val He Leu Leu Tyr Tyr lie Leu Gly Val Ser Ala Leu lie Gly Ala 
450 455 460 

Ala Val He He Leu Leu Ala Pro Val Gin Tyr Phe Val Ala Thr Lys 
465 470 475 480 

Leu Ser Gin Ala Gin Arg Thr Thr Leu Glu His Ser Asn Glu Arg Leu 
485 490 495 

Lys Gin Thr Asn Glu Met Leu Arg Gly Met Lys Leu Leu Lys Leu Tyr 
500 505 510 

Ala Trp Glu Ser He Phe Cys Ser Arg Val Glu Val Thr Arg Arg Lys 
515 520 525 

Glu Met Thr Ser Leu Arg Ala Phe Ala Val Tyr Thr Ser lie Ser He 
530 535 540 

Phe Met Asn Thr Ala He Pro lie Ala Ala Val Leu He Thr Phe Val 
545 550 555 560 

Gly His Val Ser Phe Phe Lys Glu Ser Asp Leu Ser Pro Ser Val Ala 
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565 570 575 

Phe Ala Ser Leu Ser Leu Phe His lie Leu Val Thr Pro Leu Phe Leu 
580 565 590 

Leu Ser Ser Val Val Arg Ser Thr Val Lys Ala Leu Val Ser Val Gin 
595 600 605 

Lys Leu Ser Glu Phe Leu Ser Ser Ala Glu lie Arg Glu Glu Gin Cys 
610 615 620 

Ala Pro Arg Glu Pro Ala Pro Gin Gly Gin Ala Gly Lys Tyr Gin Ala 
625 630 635 640 

Val Pro Leu Lys Val Val Asn Arg Lys Arg pro Ala Arg Glu Glu Val 
645 650 655 

Arg Asp Leu Leu Gly Pro Leu Gin Arg Leu Ala Pro Ser Met Asp Gly 
660 665 670 

Asp Ala Asp Asn Phe Cys Val Gin lie lie Gly Gly Phe Phe Thr Trp 
675 660 685 

Thr Pro Asp Gly lie Pro Thr Leu Ser Asn lie Thr lie Arg lie Pro 
690 695 700 

Arg Gly Gin Leu Thr Met He Val Gly Gin Val Gly Cys Gly Lys Ser 
705 710 715 720 

Ser Leu Leu Leu Ala Thr Leu Gly Glu Met Gin Lys Val Ser Gly Ala 
725 730 735 

Val Phe Trp Asn Ser Asn Leu Pro Asp Ser Glu Gly Arg Gly Pro Gin 
740 745 750 

Gin Pro Arg Ala Gly Asp Ser Ser Trp Leu Gly Tyr Gin Glu Gin Arg 
755 760 765 

Pro Arg Gly Tyr Ala Ser Gin Lys Pro Trp Leu Leu Asn Ala Thr Val 
770 775 780 

Glu Glu Asn He Thr Phe Glu Ser Pro Phe Asn Pro Gin Arg Tyr Lys 
785 790 795 800 

Met Val He Glu Ala Cys Ser Leu Gin Pro Asp He Asp He Leu Pro 
805 810 815 

His Gly Asp Gin Thr Gin He Gly Glu Arg Gly He Asn Leu Ser Gly 
820 825 830 

Gly Gin Arg Pro Asp Gin Cys Gly Pro Glu Pro Ser Thr Ser Arg Pro 
835 840 845 

Met Phe Val Phe Leu Asp Asp Pro Phe Ser Ala Leu Asp Val His Leu 
850 855 860 

Ser Asp His Leu Met Gin Ala Gly He Leu Glu Leu Leu Arg Asp Asp 
865 870 875 880 

Lys Arg Thr Val Val Leu Val Thr His Lys Leu Gin Tyr Leu Pro His 
885 890 895 

Ala Asd Trp He lie Ala Met Lys Asp Gly Thr He Gin Arg Glu Gly 
900 905 910 

Thr Leu Lvs Asp Phe Gin Arg Ser Glu Cys Gin Leu Phe Glu His Trp 
915 920 925 
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Lys Thr Leu Met Asn Arg Gin Asp Gin Glu Leu Glu Lys Glu Thr Val 
930 935 940 

Met Glu Arg Lys Ala Ser Glu Pro Ser Gin Gly Leu Pro Arg Ala Met 
945 950 955 960 

Ser Ser Arg Asp Gly Leu Leu Leu Asp Glu Glu Glu Glu Glu Glu Glu 
965 970 975 

Ala Ala Glu Ser Glu Glu Asp Asp Asn Leu Ser Ser Val Leu His Gin 
980 985 990 

Arg Ala Lys He Pro Trp Arg Ala Cys Thr Lys Tyr Leu Ser Ser Ala 
995 1000 1005 

Gly He Leu Leu Leu Ser Leu Leu Val Phe Ser Gin Leu Leu Lys His 
1010 1015 1020 

Met Val Leu Val Ala He Asp Tyr Trp Leu Ala Lys Trp Thr Asp Ser 
1025 1030 1035 1040 

Ala Leu Val Leu Ser Pro Ala Ala Arg Asn Cys Ser Leu Ser Gin Glu 
1045 1050 1055 

Cys Asp Leu Asp Gin Ser Val Tyr Ala Met Val Phe Thr Leu Leu Cys 
1060 1065 1070 

Ser Leu Gly He Val Leu Cys Leu Val Thr Ser Val Thr Val Glu Trp 
1075 1080 1085 

Thr Gly Leu Lys Val Ala Lys Arg Leu His Arg Ser Leu Leu Asn Arg 
1090 1095 1100 

He He Leu Ala Pro Met Arg Phe Phe Glu Thr Thr Pro Leu Gly Ser 
1105 1110 1115 1120 

He Leu Asn Arg Phe Ser Ser Asp Cys Asn Thr He Asp Gin His He 
1125 1130 1135 

Pro Ser Thr Leu Glu Cys Leu Ser Arg Ser Thr Leu Leu Cys Val Ser 
1140 1145 1150 

Ala Leu Thr Val He Ser Tyr Val Thr Pro Val Phe Leu Val Ala Leu 
1155 1160 1165 

Leu Pro Leu Ala Val Val Cys Tyr Phe He Gin Lys Tyr Phe Arg Val 
1170 1175 1180 

Ala Ser Arg Asp Leu Gin Gin Leu Asp Asp Thr Thr Gin Leu Pro Leu 
1185 1190 1195 1200 

Val Ser His Phe Ala Glu Thr Val Glu Gly Leu Thr Thr He Arg Ala 
1205 12lo 1215 

Phe Arg Tyr Glu Ala Arg Phe Gin Gin Lys Leu Leu Glu Tyr Thr Asp 
1220 1225 1230 

Ser Asn Asn He Ala Ser Leu Phe Leu Thr Ala Ala Asn Arg Trp Leu 
1235 1240 1245 

Glu Val Cys Met Glu Tyr He Gly Ala Cys Val Val Leu He Ala Ala 
1250 ' 1255 * * 1260 

Ala Thr Ser He Ser Asn Ser Leu His Arg Glu Leu Ser Ala Gly Leu 
1265 1270 1275 1280 

Val Gly Leu Gly Leu Thr Tyr Ala Leu Met Val Ser Asn Tyr Leu Asn 
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1285 1290 1295 

Trp Met Val Arg Asn Leu Ala Asp Met Glu He Gin Leu Gly Ala Val 
1300 1305 1310 

Lys Arg He His Ala Leu Leu Lys Thr Glu Ala Glu Ser Tyr Glu Gly 
1315 1320 1325 

Leu Leu Ala Pro Ser Leu He Pro Lys Asn Trp Pro Asp Gin Gly Lys 
1330 1335 1340 

He Gin He Gin Asn Leu Ser Val Arg Tyr Asp Ser Ser Leu Lys Pro 
1345 1350 1355 1360 

Val Leu Lys His Val Asn Thr Leu He Ser Pro Gly Gin Lys He Gly 
1365 1370 1375 

lie Cys Gly Arg Thr Gly Ser Gly Lys Ser Ser Phe Ser Leu Ala Phe 
1380 1385 1390 

Phe Arg Met Val Asp Met Phe Glu Gly Arg He He He Asp Gly He 
1395 1400 1405 

Asp He Ala Lys Leu Pro Leu His Thr Leu Arg Ser Arg Leu Ser He 
1410 1415 1420 

He Leu Gin Asp Pro Val Leu Phe Ser Gly Thr He Arg Phe Asn Leu 
1425 1430 1435 1440 

Asp Pro Glu Lys Lys Cys Ser Asp Ser Thr Leu Trp Glu Ala Leu Glu 
1445 1450 1455 

He Ala Gin Leu Lys Leu Val Val Lys Ala Leu Pro Gly Gly Leu Asp 
1460 1465 1470 

Ala He He Thr Glu Gly Gly Glu Asn Phe Ser Gin Gly Gin Arg Gin 
1475 1480 1485 

Leu Phe Cys Leu Ala Arg Ala Phe Val Arg Lys Thr Ser lie Phe He 
1490 1495 1500 

Met Asp Glu Ala Thr Ala Ser He Asp Met Ala Thr Glu Asn He Leu 
1505 1510 1515 1520 

Gin Lys Val Val Met Thr Ala Phe Ala Asp Arg Thr Val Val Thr He 
1525 1530 153S 

Ala His Arg Val His Thr He Leu Ser Ala Asp Leu Val Met Val Leu 
1540 1545 1550 

Lys Arg Gly Ala lie Leu Glu Phe Asp Lys Pro Glu Thr Leu Leu Ser 
1555 1560 1565 

Gin Lys Asp Ser Val Phe Ala Ser phe Val Arg Ala Asp Lys 
1570 1575 1580 
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WHAT IS CLAIMED IS: 

1. A sulfonylurea receptor encoded by the nucleic 
acid sequence of SEQ ID NO: 1. 

2. The sulfonylurea receptor of claim 1 selected 
5 from the group consisting of mouse, rat, and hamster 

sulfonylurea receptor. 

3. A sulfonylurea receptor encoded by the nucleic 
acid sequence selected from the group consisting of SEQ ID 
NO: 4 and SEQ ID NO: 6. 

10 4 . The sulfonylurea receptor of claim 3 encoded by 

the nucleic acid sequence of SEQ ID NO: 4. 

5. The sulfonylurea receptor of claim 4 further 
comprising SEQ ID NO: 2. 

6. The sulfonylurea receptor of claim 3 encoded by 
15 the nucleic acid sequence of SEQ ID NO: 6. 

7. The sulfonylurea receptor of claim 4 further 
comprising SEQ ID NO: 3. 

8 • A sulfonylurea receptor encoded by the amino 
acid sequence selected from the group consisting of SEQ ID 
20 NO: 5 and SEQ ID NO: 7. 

9. A purified nucleic acid sequence encoding a 

. sulfonylurea receptor selected from the group consisting of 
SEQ ID NO: 1, SEQ ID NO: 4, and SEQ ID NO: 6, 

10. A purified amino acid sequence encoding a 

25 sulfonylurea receptor selected from the group consisting of 
SEQ ID NO: 5 and SEQ ID NO: 7. 
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11 • An expression vector comprising a nucleic acid 
sequence selected from the group consisting of SEQ ID NO: 1, 
SEQ ID NO: 4, and SEQ ID NO: 6* 

12. A host cell capable of expressing the nucleic 
5 acid sequence selected from the group consisting of SEQ ID 

NO: 1, SEQ ID NO: 4, and SEQ ID NO: 6. 

13. A cell culture capable of expressing a 
sulfonylurea receptor encoded by the amino acid sequence 
selected from the group consisting of SEQ ID NO: 5 and SEQ ID 

10 NO: 7. 

14. A protein preparation comprising a 
sulfonylurea receptor encoded by the amino acid sequence 
selected from the group consisting of SEQ ID NO: 5 and SEQ ID 
NO: 7. 

15 15. A transgenic non-human mammal all of whose 

germ cells and somatic sells contain a recombinant sequence 
encoding a sulfonylurea receptor introduced into said mammal, 
or an ancestor of said mammal. 

16. The mammal of claim 15 wherein said sequence 
2 0 encoding a sulfonylurea receptor is selected from the group 

consisting of SEQ ID NO: 1, 4, and 6. 

17. A monoclonal antibody capable of binding to a 
sequence encoding a sulfonylurea receptor. 

18. The monoclonal antibody of claim 17 wherein 
25 said sequence encoding a sulfonylurea receptor is selected 

from the group consisting of SEQ ID NO: 5 and 7. 

19. A method of detecting persistent 
hyperinsulinemic hypoglycemia of infancy comprising 
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obtaining a sample comprising patient nucleic acids 
from a patient tissue sample; 

amplifying sulfonylurea receptor specific nucleic 
acids from said patient nucleic acids to produce a test 
5 fragment; 

obtaining a sample comprising control nucleic acids 
from a control tissue sample; 

amplifying control nucleic acids encoding wild type 
sulfonylurea receptor to produce a control fragment; 
10 comparing the test fragment with the control 

fragment to detect the presence of a sequence difference in 
the test fragment, wherein a difference in said test fragment 
indicates persistent hyperinsulinemic hypoglycemia of 
infancy . 



15 20, The method of claim 19 wherein a sequence 

difference is selected from the group consisting of a nucleic 
acid transition and a restriction digest pattern alteration. 

21. The method of claim 19 wherein a sequence 
difference is a nucleic acid transition. 



20 22. The method of claim 21 wherein said nucleic 

acid transition is a G to A transition at nucleic acid 
position 750 of SEQ ID NO: 26. 

23. The method of claim 21 wherein a nucleic acid 
transition results at position 27 of SEQ ID NO: 29. 

25 24. The method of claim 19 wherein a sequence 

difference is a restriction digest pattern alteration. 

25. The method of claim 24 wherein said 
restriction digest pattern alteration comprises test 
fragments of about 304 base pairs and about 123 base pairs 
3 0 upon digestion with MspJ. 
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26. The method of claim 24 wherein said 
restriction digest pattern alteration comprises a test 
fragment of about 14 6 base pairs upon digestion with Neil. 



27. The method of claim 19 wherein said 
5 amplification step comprises performing the polymerase chain 
reaction. 



28. The method of claim 27 wherein the polymerase 
chain reaction comprises using a pair of primers, wherein one 
primer within said pair is selected from the group consisting 
10 of SEQ ID NOS: 16-24. 



29* The method of claim 27 wherein said polymerase 
chain reaction comprises the use of two primers, a first 
primer selected from the group consisting of SEQUENCE ID NOS: 
17, 18, 21, and 23, and a second primer selected from the 
15 group consisting of SEQUENCE ID NOS: 16, 19, 20, 22, and 24. 

30. The method of claim 19 wherein said tissue 
sample is selected from the group consisting of pancreatic 
tissue, blood, serum, saliva, sputum, mucus, bone marrow, 
urine, lymph, and a tear* 



2 0 31. A method of detecting persistent 

hyperinsulinemic hypoglycemia of infancy comprising 

obtaining a sample comprising patient mRNA from a 
patient tissue sample; 

reverse transcribing said mRNA into cDNA to produce 
.25 patient cDNA; 

amplifying sulfonylurea receptor specific cDNA from 
said patient cDNA to produce a test fragment; 

obtaining a sample comprising control nucleic acids 
from a control tissue sample; 
30 amplifying control DNA encoding wild type 

sulfonylurea receptor to produce a control fragment; 
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digesting said test fragment and said control 
fragment with MspX; 

comparing the test fragment with the control 
fragment to detect test fragments having about 304 base pairs 
5 and about 12 3 base pairs, wherein said test fragments 

indicate persistent hyperinsulinemic hypoglycemia of infancy* 



• 32. A method of detecting persistent 
hyperinsulinemic hypoglycemia of infancy comprising 

obtaining a sample comprising patient genomic DNA 
10 from a patient tissue sample; 

amplifying sulfonylurea receptor specific DNA from 
said patient genomic DNA to produce a test fragment; 

obtaining a sample comprising control nucleic acids 
from a control tissue sample; 
15 amplifying control DNA encoding wild type 

sulfonylurea receptor to produce a control fragment; 

comparing the test fragment with the control 
fragment to detect a test fragment having G to A transition 
at nucleic acid position 750 of SEQ ID NO: 26, wherein said 
20 test fragment indicates persistent hyperinsulinemic 
hypoglycemia of infancy. 

33. A method of detecting persistent 
hyperinsulinemic hypoglycemia of infancy comprising 

obtaining a sample comprising patient mRNA from a 
25 patient tissue sample; 

reverse transcribing said mRNA into cDNA to produce 
patient cDNA; 

amplifying sulfonylurea receptor specific cDNA from 
said patient cDNA to produce a test fragment; 
3 0 obtaining a sample comprising control nucleic acids 

from a control tissue sample; 

amplifying control DNA encoding wild type 
sulfonylurea receptor to produce a control fragment; 

digesting said test fragment and said control 
3 5 fragment with Neil; and 
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comparing the test fragment with the control 
fragment to detect a test fragment of about 146 base pairs 
wherein said test fragment indicates persistent 
hyper insulinemic hypoglycemia of infancy. 

5 34. A method of claim 32, 33, or 34, wherein said 

amplification step comprises performing the polymerase chain 
reaction. 

35. A sulfonylurea receptor encoded by the nucleic 
acid sequence of SEQ ID NO: 26. 

10 36, The sulfonylurea receptor of claim 3 5 which is 

a human sulfonylurea receptor. 

37. A sulfonylurea receptor encoded by the amino 
acid sequence of SEQ ID NO: 28. 

38. A purified nucleic acid sequence encoding a 
15 sulfonylurea receptor comprising the sequence of SEQ ID NO: 

26. 

39. A purified amino acid sequence encoding a 
sulfonylurea receptor comprising the sequence of SEQ ID NO: 
28. 

20 40. An expression vector comprising a nucleic acid 

sequence of SEQ ID NO: 26. 

41. A host cell capable of expressing the nucleic 
acid sequence of SEQ ID NO: 26. 

42. A cell culture capable of expressing a 

25 sulfonylurea receptor encoded by the amino acid sequence of 
SEQ ID NO: 28. 
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43. A protein preparation comprising a 
sulfonylurea receptor encoded by the amino acid sequence of 
SEQ ID NO: 28. 



44. A transgenic non-human mammal having all or 
5 part of its own sulfonylurea receptor sequence suppressed. 

45. A transgenic non-human mammal comprising a 
recombinant wild type sequence for all or part of the 
sulfonylurea receptor sequence* 

46. The transgenic non-human mammal of claim 45 

10 wherein said sulfonylurea receptor sequence is SEQ ID NO: 26, 

47. A monoclonal antibody capable of binding to a 
sequence encoding a sulfonylurea receptor, wherein said 
sequence encoding a sulfonylurea receptor is SEQ ID NO: 28. 



48. A diagnostic kit for detecting persistent 
15 hyperinsulinemic hypoglycemia of infancy comprising in one or 
more containers a pair of primers , wherein one primer within 
said pair is complementary to a region of the sulfonylurea 
receptor, wherein one of said pair of primers is selected 
from the group consisting of SEQ ID NOS: 16-24, a probe 
20 specific to the amplified product, and a means for 

visualizing amplified DNA, and optionally including one or 
more size markers, and positive and negative controls. 



49. The diagnostic kit of claim 48 wherein said 
means for visualizing amplified DNA is selected from the 

25 group consisting of fluorescent stain, 32 P, and biotin. 

50. The sequence encoded by a SEQ ID NO selected 
from the group consisting of SEQ ID NO: 16, SEQ ID NO: 17, 
SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, 
SEQ ID NO: 22, SEQ ID NO: 23, and SEQ ID NO: 24. 

30 
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